
HUMAN TELOMERASE 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAG 60 
MetProArgAlaProArgCysArgAlaValArgSerLeuLeuArgSerHisTyrArgGlu 20 



GTGCTGCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAG 120 
ValLeuProLeuAlaThrPheValArgArgLeuGlyProGlnGlyTrpArgLeuValGln 40 



CGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG 180 
ArgGlyAspProAlaAlaPheArgAlaLeuValAlaGlnCysLeuValCysValProTrp 60 



GACGCACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTG 240 
AspAlaArgProProProAlaAlaProSerPheArgGlnValSerCysLeuLysGluLeu 80 



GTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGC 300 
ValAlaArgValLeuGlnArgLeuCysGluArgGlyAlaLysAsnValLeuAlaPheGly 100 



TTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGC 360 
PheAlaLeuLeuAspGlyAlaArgGlyGlyProProGluAlaPheThrThrSerValArg 120 



AGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTG 420 

SerTyrLeuProAsnThrValThrAspAlaLeuArgGlySerGlyAlaTrpGlyLeuLeu 140 

TTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTG 480 

LeuArgArgValGlyAspAspValLeuValHisLeuLeuAlaArgCysAlaLeuPheVal 160 



CTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCT 540 
LeuValAlaProSerCysAlaTyrGlnValCysGlyProProLeuTyrGlnLeuGlyAla 180 



GCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAA 600 

AlaThrGlnAlaArgProProProHisAlaSerGlyProArgArgArgLeuGlyCysGlu 200 

CGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGT 660 

ArgAlaTrpAsnHisSerValArgGluAlaGlyValProLeuGlyLeuProAlaProGly 220 

GCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGT 720 V' 1 A 

AlaArgArgArgGlyGlySerAlaSerArgSerLeuProLeuProLysArgProArgArg 240 1 1 b' in 




GGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGC 780 

GlyAlaAlaProGluProGluArgThrProValGlyGlnGlySerTrpAlaHisProGly 260 

AGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAA 840 

ArgThrArgGlyProSerAspArgGlyPheCysValValSerProAlaArgProAlaGlu 280 

GAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGC 900 

GluAlaThrSerLeuGluGlyAlaLeuSerGlyThrArgHisSerHisProSerValGly 300 



CGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCCTGGGACACGCCT 960 

ArgGlnHisHisAlaGlyProProSerThrSerArgProProArgProTrpAspThrPro 320 

TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAG 1 020 

CysProProValTyrAlaGluThrLysHisPheLeuTyrSerSerGlyAspLysGluGln 340 

CTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTC 1080 

LeuArgProSerPheLeuLeuSerSerLeuArgProSerLeuThrGlyAlaArgArgLeu 360 



GTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCC 1 1 40 
ValGluThrllePheLeuGlySerArgProTrpMetProGlyThrProArgArgLeuPro 380 



CGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCAC 1 200 

ArgLeuProGlnArgTyrTrpGlnMetArgProLeuPheLeuGluLeuLeuGlyAsnHis 400 

GCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACC 1 260 

AlaGlnCysProTyrGlyValLeuLeuLysThrHisGysProLeuArgAlaAlaValThr 420 

CCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAG 1320 

ProAlaAlaGlyValCysAlaArgGluLysProGlnGlySerValAlaAlaProGluGlu 440 



GAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAG 1380 
GluAspThrAspProArgArgLeuValGlnLeuLeuArgGlnHisSerSerProTrpGln 460 



GTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCC 1440 
ValTyrGlyPheValArgAlaCysLeuArgArgLeuValProProGlyLeuTrpGlySer 480 



AGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT 
ArgHisAsnGluArgArgPheLeuArgAsnThrLysLysPhelleSerLeuGlyLysHis 500 



1500 Fig. IB 




CAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGGCTGCGCTTGGCTG 1 560 

XlaLysLeuSerLeuGlnGluLeuThrTrpLysMetSerValArgAspCysAlaTrpLeu 520 

CGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATC 1 620 

ArgArgSerProGlyValGlyCysValProAlaAlaGluHisArgLeuArgGluGluIle 540 

CTGGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC 1 680 

LeuAlaLysPheLeuHisTrpLeuMetSerValTyrValValGluLeuLeuArgSerPhe 560 



TTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTC 1 740 

PheTyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerVal 580 

TGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAG 1 800 

TrpSerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 600 



CTGTCGGAAGCAGAGGTCAGGCAGCATCGGGMGCCAGGCCCGCCCTGCTGACGTCCAGA 1 860 
LeuSerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArg 620 



CTCCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTG 1 920 

LeuArgPhelleProLysProAspGlyLeuArgProIleValAsnMetAspTyrValVal 640 

GGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCA 1980 

GlyAlaArgThrPheArgArgGluLysArgAlaGluArgLeuThrSerArgValLysAla 660 

CTGTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTG 2040 

LeuPheSerValLeuAsnTyrGIuArgAlaArgArgProGlyLeuLeuGlyAlaSerVal 680 

CTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAG 2 1 00 

LeuGlyLeuAspAspIleHisArgAlaTrpArgThrPheValLeuArgValArgAlaGln 700 

GACCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATC 21 60 
AspProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrlle 720 

CCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGC 2220 
ProGlnAspArgLeuThrGluVallleAlaSerllelleLysProGlnAsnThrTyrCys 740 

GTGCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAG 2280 Fig. 1C 
ValArgArgTyrAlaValValGlnLysAlaAlaHisGlyHisValArgLysAlaPheLys 760 
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7AGCCAC6TCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 2340 
SerHisValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAlaHisLeu 780 



CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAG 2400 

GlnGluThrSerProLeuArgAspAlaValVallleGluGlnSerSerSerLeuAsnGlu 800 

GCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 2460 

AlaSerSerGlyLeuPheAspValPheLeuArgPheMetCysHisHisAlaValArglle 820 



AGGGGCAAGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTG 2520 

ArgGlyLysSerTyrValGlnCysGlnGlylleProGlnGlySerlleLeuSerThrLeu 840 

CTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGAC 2580 

LeuCysSerLeuCysTyrGlyAspMetGluAsnLysLeuPheAlaGlylleArgArgAsp 860 



GGGCTGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG 2640 

GlyLeuLeuLeuArgLeuValAspAspPheLeuLeuValThrProHisLeuThrHisAla 880 

AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTG 2700 

LysThrPheLeuArgThrLeuValArgGlyValProGluTyrGlyCysValValAsnLeu 900 



CGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT 2760 

ArgLysThrValValAsnPheProValGluAspGluAlaLeuGlyGlyThrAlaPheVal 920 

CAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTG 2820 

GlnMetProAlaHisGlyLeuPheProTrpCysGlyLeuLeuLeuAspThrArgThrLeu 940 

GAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC 2880 

GluValGlnSerAspTyrSerSerTyrAlaArgThrSerlleArgAlaSerLeuThrPhe 960 

AACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTG 2940 

AsnArgGlyPheLysAlaGlyArgAsnMetArgArgLysLeuPheGlyValLeuArgLeu 980 

AAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAAC 3000 

LysCysHisSerLeuPheLeuAspLeuGlnValAsnSerLeuGlnThrValCysThrAsn 1000 

ATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCA 3060 Fig. ID 

IleTyrLysIleLeuLeuLeuGlnAlaTyrArgPheHisAlaCysValLeuGlnLeuPro 1020 



TTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCC 3120 
PheHisGlnGlnValTrpLysAsnProThrPhePheLeuArgVallleSerAspThrAls 1040 



TCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGC 31 80 
SerLeuCysTyrSerlleLeuLysAlaLysAsnAlaGlyMetSerLeuGlyAlaLysGly 1060 



GCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTC 3240 
AlaAlaGlyProLeuProSerGluAlaValGlnTrpLeuCysHisGlnAlaPheLeuLeu 1080 



AAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAG 3300 
LysLeuThrArgHisArgValThrTyrValProLeuLeuGlySerLeuArgThrAlaGln 1 1 00 



ACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAAC 3360 
ThrGlnLeuSerArgLysLeuProGlyThrThrLeuThrAlaLeuGluAlaAlaAlaAsn 1120 



CCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACtgatggccacccgcccacagccag 3420 

ProAlaLeuProSerAspPheLysThrlleLeuAsp 1132 

Gccgagagcagacaccagcagccctgtcacgccgggctctacgtcccagggagggagggg 3480 

Cggcccacacccaggcccgcaccgctgggagtctgaggcctgagtgagtgtttggccgag 3540 

gcctgcatgtccggctgaaggctgagtgtccggctgaggcctgagcgagtgtccagccaa 3600 

gggctgagtgtccagcacacctgccgtcttcacttccccacaggctggcgctcggctcca 3660 

ccccagggccagcttttcctcaccaggagcccggcttccactccccacataggaatagtc 3720 

catccccagattcgccattgttcacccctcgccctgccctcctttgccttccacccccac 3780 

eatccaggtggagaccctgagaaggaccctgggagctctgggaatttggagtgaccaaag 3840 

gtgtgccctgtacacaggcgaggaccctgcacctggatgggggtccctgtgggtcaaatt 3900 

ggggggaggtgctgtgggagtaaaatactgaatatatgagtttttcagttttgaaaaaaa 3960 
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Euplote* 1 - ME VO ^DNQADNHG I HS4 L KTC EE I KEAXTZ. YSWIQK V I RC/? - - N0SQ5H YKdLlDl \ 

HTl " 1 RRLGPQGWRLVQRGDPAAFRALVAQCLl/CVPWDAR-PPP/APSFRQVSCU(ELVARVLQRLCD?GAKNVLAFGFA/./.DGA 
EST2 i MX IZ.FEFIODKLD ID - - L(7TNST Wf EMLKCG 

Euplotes 56 IFAQTNIVA7TRDYNEEDFKVWRK EVFSTGLMIEIMCLVE/./.SSSDVSDRQKLQCFGFGL'<GIVC-L i '< 

HT1 80 RGGPPEAFT IS V RS YL/WTVTQAZ RGSGAWGLLLRRVGDDVLVHLLARCALFV/.VAPSCAY - - - QVCGPPLYCLGAATQA 

EST2 30 HFNGLDEILTT-CFALfWSRKMP CLPSDLSHKAV7DHCIIY/./.TGELYNN — VLTFfiYKIARiVEDVNN 

Euplotes 126 THLLT/lLSTQKQYFFQDEWWQVRAMI6NELFRHiYTKJ'LIF(?RTSEGTLVOFC6A/NVFDHLKVNDKFDKKQKGGAADMNE 

HT1 157 RPPP/MSGPRRRLGCEMWWHSVREA6VPLGLPAPGARRRGGSASRSLPLPKRPR/?GAAPEPERTPI^GQGSWAHPGRTRG 

EST2 97 SL FCWS AN V NVT/LL KGMW KMFHS LVGTY AFVDi L I N /TV I (?FNGQ - F FTC I VfflVKCNEPHLPPKWI/QRSSSS 

Euolotes 206 PRCC5TCKYNVXNEKDHFLA/NI - NVPNWNNMKSRTRi FVCTHFA/RNNOFF 

HTl 237 PSDRGFCVVSPARPAEEArSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAE I XHFi. KSSGDK- - EOLR 

EST2 169 SATAAQIKQLTFPVTW - XQF/.HKLN1W-SSSFF 

Euplotes 255 KKHEFVSNKNN7SAM-DRAQ77FTNI F/?FN/?IRKKIXDKV I EKIAYM/. EKV/CDFWFNY YiLTKSCF/. PFNWRE 

HTl 315 PSFLLSSLRPSLT(SARR/.VfnFLGSRPWMPGTPR/?/.P/?lJ'(?RY-WQMRPLFLE/./.GNWAQCPy'GVL/.KTHCF/.RAAVTP 

EST2 200 PYSOLPSSSSIKKLTD/.RFA7FP TNLVK/fWiXVRINLTLQKLLKRWfRUtfyVSUNSICPPLEGT- - 

Telomerase domain 

Euplotes 326 RK QXIENLINKT/?££KS--KYYEEZFSYTTDNKCin'QFJNEFFYNIZ.WDFLTGR-A/RKNFQXKVXXYVEZ.NXWE 

HTl 394 AAGVCAREXPQGSVMPE££DTDPRRLVQLLRQHSSPWQI/YGFVRACLRRIVPPGLVWS/WWERRFLRAn"XXFIS/.GXWA 

EST2 268 VLDLSHLSKQ 5PKE RI/LKFJIVIZ.QKLLPQEMFGSKI0VKGKI IXWLNLLL5f.PLNG 

Euplotes 398 L I HKNLLLEXI NT/?E I S(/MQVET - SAKHF Y YFDHEiV I Y VZ.WXLLRWIFEDZ. /VSZ.iRCFFV'I^T£ 1 QQICSYSKTY /V7?XN I W 

HTl 474 KZSLQELTWKMSVRW)AtfiRRSPGVGCW 

EST2 324 Yf. PFDSLf-KXLRLKgFR^Z. F 1 5D - IWFTKHNFENZ.jV-QLAICFISyi.FRQf. I PKI I QTFF/C F£ I S - 5TVT I V yF/?HDTIV 

Motif 1 Motif 2 

Euplotes 477 DV IMKMSJADLKX- ETZ.A£VQ£K£VEEWKKSL -GFAPGXZ./?LJPXXTT - -FflPIMTFNKK I VNSDRK - - TTXLTTNTKLL 

HTl 554 SXZ.QSIGIRQHLXRVQZ.R£LS£A£l'RQHREARPALLTSRZ./?F7FXPDG- -LRP/VNMDYVVGARTFRREKRAf RLTSRVK 

EST2 401 NXi. ITPF JVEYFX - TYZ. VFNNVCRNHNSYTLS - NFNHSXM/?! IPXXSNNEFRI/A I PCRGADE EEFT - - IYXFNHKNAIQ 



Fig. 2A 



Motif A 

Euplotes 551 NSHLM/.KT£KW-/?MFKOPFGFA^FNYDDVMXKYEEfl'CmQVGQP-KLFFATMOIEKCWSVN/?EK/.STFL/mK.L55 
HT1 632 ALFSVLNrE/^--PGLLGAmGLO0IHR^ 
EST2 477 PTQKUEKZ./?A/K/?PTSFTKIY5PTQIADRIKETO 



Euplotes 629 0/^IMTAQILKRKA/NII^IDSKiVFRKKEMKOYFRQKFQKIALEGGQYPTLFSVLENEONDLNAK:<TLIVfA'K-CRA/ v «:C ! 

HT1 710 TYC W?RYAWQKAAHGHVRKAFKSHVS TLTDLQPYMRQFVAHLOETSPLRDAVVJfQSSSLWEASSG 

EST2 556 GFF W?SQY F FN - TWTGHKLFWV V N -A--SRVPKPYELYIDNVR7VHZ.SNQDWNVV-EM£im"- 

Motif B Motif C 

Euplotes 708 NLLQP^INICQYNYINFNS/fFKK(?TKfiJWLCVSSILSSFYKATtffSSLGfLRDESMNPENPNVNi/.M/?LTD0Y/./.J7 

HT1 777 LFOVFLRFMCHHAVRi R- G/fS /Vf?CQGJP(?6SIL5TU.CSZ.CrGDMf N - - - KLFAG I RRD SLLLRLWDFLIMT 

EST2 616 --ALW1/EDKCYIR EDGLF0GSSZ.SAPIVDZ.V)T)DafFYSEFK4SPSQD UIZ.KZ.AD0F/. I JS 

Motif D Motif E 

Euplotes 788 7"QENW V IFIIKL I N VSRf NGFKFA/MKKLQTSFPZ.SPSKFAKYGMDSVEEQN I l/QOYCDWIGISIDMKTLALMPNIA/LRi 
HT1 847 PHLTH4KTFLRT/.VRGVPFYGCVVA/LWCTVVNFFVEDEALGG-TAFVOMPAHGLFPWCGLLLDTRT/.f YQSDYSSYAR- - 
EST2 677 TOQQ-QKIN/K/fLAMS— -SFQKYNA/fANRDKILAVS SZ?SDDDT I' IQFCAMH I F VKELf l/WKHSSTMA' - - - 



Euplotes 868 EG/LCT/.A/L/VMQTCKASMWL^KLKSFLMNNirHYFRXrJTrEDFANKTLN/CLFISGGJ'KYMQCA/CEY- -KDHFKXWLAM 
HT1 924 TS/RAS/.TFtfRGF/(AGRNMRRXLFGVZ.RLKCHSLFLDLQVNSLQTVCTNIY/(ILLLQArRFHACVLQLPFHGQVW/(WP7-F 
EST2 741 MFHIRSJCSS #fGIFRS/.IALFNrRISY/a7DTMLMSTNTVLMQIDHVVKN I SECY/fSA- - FKOLS 



Euplotes 946 SSMIDLEVSKJJKSVTWFFKYLVCNI/^DTIFGEFHYPDFFZ.ST/.KHFIFIFSTKKYIFNRVCMJZ.KA/CfAK/.KSOQCQS 
HT1 1004 FLRVISDTASLCySILMKNAGMSLGMGMGPLPSEAVQWLC-HQAFLLKLTRHRVTYVPLLGSZ.RTAQTZ?Z.SRKLPGT 
EST2 808 NMQFHSF LQR//EMTVSG - - - -CPITKCOPLIEYf VR- -FTI --LNGFZ.fSZ.SSNTSKF-KDNI7Z.LR/CfI0HLQAYIYI 



Euplotes 1026 LJQYDA -- 

HT1 1083 TLTALEAAANPALPSDFKT I L0 
EST2 879 YIHIVN--- 
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222 223 
Y 5'-CCAGGTG|ggcctc gcaggtg | TCCTGCC - 3 ' 

1950 1952 

1 5 ' - AAAGAGG | GTGGCTG AACAGAA | GCCGAGC - 3 ' 

2130 2167 
a 5' -TGTCAAG|gtggatg cccccag|GACAGGC-3' 

2286 2468 
b 5"-GAGCCAC|gtctcta ggggcaa|GTCCTAC-3' 

2843 2844 

2 5' -ACTCCAG|GTGAGCG XXXXXXX|CTATGCC-3' 

3157 

3 5 ' - AACGCAG | CCGMGAAAACATTTCTGTCGTGACTCCTGCGGTGCTTGGGTCGGGACAGCCAGAGATGG 

T A A EENILVVTPAVLGSGQPEME 

AGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTG 
PPRRPSGVGSFPVSPGRGVG 

3158 

GGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG | GGATGTC - 3 ' 
L G L * 



Fig. 7C 



abcdefghijkl 




_ 430 bp 
-400 

-250 
"220 
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sequence "Y" 104-105 bases 

GGCCTCCCCGGGGTCG6CGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAG 
GlyLeuProGlyValGlyValArgLeuGlyLeuArgAlaAlaGlyGlyAsnGln 
AlaSerProGlySerAlaSerGlyTrpGl^ * GlyArgProGlyGlyThrSer 
ProProArgGlyArgArgProAlaGlyValGluGlyGlyArgGlyGluProAla 

CGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
ArgHisAlaGluSerSerAlaGlyAspSerGlyArgPheProArgArg 
AspMetArgArgAlaAlaGlnAlaThrGlnGlyAlaSerProAlaGly 
ThrCysGlyGluGlnArgArgArgLeuArgAl aLeuProProGI nVal 

sequence "1" 38 bases 
GTGGCTGTGCTTTGGTTTAACTTCCTTTTTAACCAGAA 
ValAlaVal LeuTrpPheAsnPheLeuPheAsnGl nLys 



sequence "a" 36 bases 

GTGGATGTGACGGGCGCGTACGACACCATCCCCCAG 

ValAspValThrGlyAlaTyrAspThrlleProGln 



sequence "p" 182 bases 

GTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 
ValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAlaHisLeu 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTG 
GlnGluThrSerProLeuArgAspAl a ValVal II eGl uGl nSerSerSerLeu 

AATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCAC 
AsnGluAl aSerSerGlyLeuPheAspVal PheLeuArgPheMetCysHi sHi s 

GCCGTGCGCATCAGGGGCAA 
AlaValArglleArgGlyLys 



partial sequence "2" unknown length 

GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAG 
Ter 

GGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGT 
CAGATGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTG 
TGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA... 



Fig. 10 A 



sequence "3" 159 bases 
CCGAAGAAAACATTTCTGTCGTGACTCCTGCGGT6CTTGGGTCGGGACAGCCAGAG 
AlaGluGluAsnlleSerValValThrProAlaValLeuGlySerGlyGlnProGlu 

ATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGG 
MetGluProProArgArgProSerGlyValGlySerPheProValSerProGlyArg 

GGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
GlyValGlyLeuGlyLeu * 



sequence "X" unknown length 

...GACAGTCACCAGGGGGGTTGACCGCCGGACTGGGCGTCCCCAGGGTTGACTATAGGA 
CCAGGTGTCCAGGTGCCCTGCAAGTAGAGGGGCTCTCAGAGGCGTCTGGCTGGCATGG 
GTGGACGTGGCCCCGGGCATGGCCTTCTGCGTGTGCTGCCGTGGGTGCCCTGAGCCCT 
CACTGAGTCGGTGGGGGCTTGTGGCTTCCCGTGAGCTTCCCCCTAGTCTGTTGTCTGG 
CTGAGCAAGCCTCCTGAGGGGCTCTCTATTG 



partial sequence of genomic intron (approximately 2.7 kb) 
GTGGCTGTGCTTTGGTTTAACTTCCTTTTTMCCAGAAGTGCGTTTGAGCCCCACATT 
TGGTATCAGCTTAGATGAAGGGCCCGGAGGAGGGGCCACGGGACACAGCCAGGGCCAT 
GGCACGGCGCCCACCCATTTGTGCGCACAGTGAGGTGGCCGAGGTGCCGGTGCCTCCA 
GAAAAGCAGCGTGGGGGTGTAGGGGGAGCTCCTGGGGCAGGGAC. . . . 



Fig. I OB 



Truncated telomerase 

ATGCCGCGCGCTCCCCGCTGCCGA6CCGTGCGCTCCCTGCTGC6CAGCCACTACCGCGAGGTGCT6CC6CTGGCCACGTTCGTG 
H P R A P R C R A V R S L L R S H Y R E V L P L A T F V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P 0 G W R I V Q R G D P A A F R A L V A Q C L V C V P W 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRICERGAKNVLAFGFALLDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T 0 A L R G S G A W G I L L R R V G 0 0 ¥ 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I y H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H H H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P L P K R P R R G A A P I P E R T P V G Q G S V A H P G R T R G P S 0 R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
GfCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFILS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S l R P S L I G A R R L V E T I F L G S R P H H P G 1 P R R L P R L P Q R Y W Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R p L F I E L I G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
[KPQGSVAAPEEEDTOPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFIRNTKKFISIGKHAKISLQEI 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
T H K H S V R D C A W I R R S P G V G C V P A A E H R L R E E I I A K F L H H L 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELIRSFFYVTEMFQKNRLFFYRKSVWSKLQSIG 

AAT- NNN- -GACAGTCACCAGGGGGGTTGACCGCCGGACTGGGCGTCCCCAGGGTTGACTATAGGACCAGGTGTCCAGGTGCCCTGCAAGTAGAGGGGCTCTCAGAGGCGTCTGGCTGG 

Fig. 11 A 




GGTG6ACGTGGCCCCGGGCATGGCCTTCTGCGTGTGCTGCC6T6GGTGCCCTGAGCCCTCACT6AGTCGGTGGG6GCTT6TG6CTTCCCGTGAGCTTCCCCCTAGTCTGTTGTCTG 
GCTGAGCAAGCCTCCTGAGGGGCTCTCTATTG— 



Fig. 11B 



Truncated protein 1 

ATGCCGCGCGCTCCCCGCTGCC6AGCCGTGC6CTCCCTGCTGCGCA6CCACTACC6CGAGGT6CT6CC6CTG6CCACGTTC6TG 
H P R A P R C R A V R S L L R S H Y R E V L P L A T P V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R l G P Q G W R L V Q R G 0 P A A F R A L V A Q C L V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVIQRICERGAKNVLAFGFALLOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T 0 A L R G S G A H G L L L R R V G D 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I V H L L A R C A L f V L V A P S C A Y Q V C G P P L Y Q L G A A I Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R 0 H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W 0 T P C P P V Y A E T K H F L Y S S G D K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SIRPSLTGARRIVETIFLGSRPHMPGTPRRLPRLPQRYHQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELLGNHAQCPYGVLIKTHCPIRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
£ K P Q G S V A A P E E E 0 T 0 P R R L V Q L L R Q H S S P W Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
IRRLVPPGLWGSRHNERRFIRNTKKFISLGKHAKLSIQEI 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHUL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKIQSIG 



Fig. 11C 



AATCAGACAGCACTTGAAGAGGGT6CA6CTGCGGGAGCTGTC6GAAGCAGAGGTCAG6CAGCATCG6GAAGCCAGGCCCGCCCTGCTGAC6TCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVOLRELSEAEVRQHREARPALLTSRLRFIPKPD 



GT6GCTGTGCTTTGGTTTAACTTCCTTTTTAACCAGAA 
VAVLWFTFLFNOK 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GLRPIVNMDYVVGARTFRREKR P S V S F R G * 



Fig. I ID 



Truncated protein 2 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCT6CGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
HPRAPRCRAVRSLLRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G 0 P A A F R A L V A Q C L V C V P W D A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRLCERGAKNVIAFGFALIDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T 1 S V R S Y L P N T V T 0 A L R G S G A H G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
[ V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
HASGPRRRLGCERAWNHSVREAGVPIGLPAPGARRRGGSA 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W D T P C P P V Y A E T K H F I Y S S G D K E Q I R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWHPGTPRRLPRLPQRYHQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
HRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTDPRRLVQLLRQHSSPHQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
l R R L V P P G I W G S R H N E R R f I R N T K K F I S L G K H A K I S L Q E L 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKMSVROCAHLRRSPGVGCVPAAEHRLREEILAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYVTETTFQKNRLFFYRKSVHSKLQSIG 



Fig. HE 



AATCAGACA6CACTTGAAGAGGGTGCAGCTGC6GGAGCTGTCGGAAGCA6AGGTCA.G6CAGCATCGGGAA6CCAG6CCCGCCCTGCT6ACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
I RQHLKRVQLRELSEAEVROHREARPALLTSRLRFI PKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GIRMVNHDYVVGARTFRREKRAERITSRVKALFSVLNYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G I L G A S V L G L 0 D I H R A W R T F V L R V R A Q D P P P E L Y F 

TGTCMGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
VKVDVTGAYDTIPQORLTEVIASIIKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCAC 
KAAHGHVRKAFKSH 

GTCCTACGTCCAGTG 
V L R P V 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
P G 0 P A G L H P L H A A L Q P V L R R H G E Q A V C G D 5 A G R A A P A F G G 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 



Fig. 11F 



Reference protein 



ATGCC6CGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACC6CGAG 
MetProArgAlaProArgCysArgAlaValArgSerLeuLeuArgSerHisTyrArgGlu 

GTGCTGCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAG 
ValLeuProLeuAlaThrPheValArgArgLeuGlyProGlnGlyTrpArgLeuValGln 

CGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG 
ArgGlyAspProAl aAl aPheArgAl aLeuVal Al aGl nCysLeuVal CysVal ProTrp 

GACGCACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTG 
AspAlaArgProProProAlaAlaProSerPheArgGlnValSerCysLeuLysGluLeu 

GTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGC 
Val Al aArgVal LeuGl nArgLeuCysGl uArgGlyAl aLysAsnVal LeuAl aPheGly 

TTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGC 
PheAl aLeuLeuAspGlyAl aArgGlyGlyProProGl uAl aPheThrThrSerVal Arg 

AGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTG 
SerTyrLeuProAsnThrValThrAspAlaLeuArgGlySerGlyAlaTrpGlyLeuLeu 

TTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTG 
LeuArgArgValGlyAspAspVal LeuVal Hi sLeuLeuAl aArgCysAl a LeuPheVal 

CTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCT 
LeuVal AlaProSerCysAlaTyrGlnValCysGlyProProLeuTyrGlnLeuGlyAl a 

GCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAA 
AlaThrGlnAlaArgProProProHisAlaSerGlyProArgArgArgLeuGlyCysGlu 

CGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGT 
ArgAlaTrpAsnHisSerValArgGluAlaGlyValProLeuGlyLeuProAlaProGly 

GCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGT 
AlaArgArgArgGlyGlySerAlaSerArgSerLeuProLeuProLysArgProArgArg 

GGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGC 
GlyAlaAl aProGl uProGl uArgThrProValGlyGl nGlySerTrpAl aHi sProGly 

AGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAA 
ArgThrArgGlyProSerAspArgGlyPheCysVal Val SerProAl aArgProAl aGl u 

GAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGC 
GluAlaThrSerLeuGluGlyAlaLeuSerGlyThrArgHisSerHisProSerValGly 



Fig. 11G 
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.^^TBCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCCTGGGACACGCCT 960 

'^-^ . ArgGlnHisHisAlaGlyProProSerThrSerArgProProArgProTrpAspThrPro 320 

. TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAG 1020 

CysProProValTyrAlaGluThrLysHisPheLeuTyrSerSerGlyAspLysGluGln 340 

CTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTC 1080 

LeuArgProSerPheLeuLeuSerSerLeuArgProSerLeuThrGlyAlaArgArgLeu 360 

GTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCC 1140 

ValGluThrllePheLeuGlySerArgProTrpMetProGlyThrProArgArgLeuPro 380 

CGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCAC 1200 

ArgLeuProGlnArgTyrTrpGlnMetArgProLeuPheLeuGluLeuLeuGlyAsnHis 400 

GCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACC 1260 

AlaGl nCysProTyrGlyVal LeuLeuLysThrHi sCysProLeuArgAl aAlaValThr 420 

CCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAG 1320 

ProAl aAl aGlyVal CysAl aArgGl uLysProGl nGlySerVal Al aAl aProGl uGl u 440 

GAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAG 1380 

GluAspThrAspProArgArgLeuValGlnLeuLeuArgGlnHisSerSerProTrpGln 460 

GTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCC 1440 

ValTyrGlyPheValArgAlaCysLeuArgArgLeuValProProGlyLeuTrpGlySer 480 

AGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT 1500 

ArgHisAsnGluArgArgPheLeuArgAsnThrLysLysPhelleSerLeuGlyLysHis 500 

GCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGGCTGCGCTTGGCTG 1560 

AlaLysLeuSerLeuGlnGluLeuThrTrpLysMetSerValArgAspCysAlaTrpLeu 520 

CGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATC 1620 

ArgArgSerProGlyVal GlyCysVal ProAl aAl aGl uHi sArgLeuArgGl uGl ull e 540 

CTGGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC 1680 

LeuAlaLysPheLeuHisTrpLeuMetSerValTyrValValGluLeuLeuArgSerPhe 560 

TTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTC 1740 

PheTyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerVal 580 

TGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAG 1800 

TrpSerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 600 

CTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGA 1860 

LeuSerGl uAl aGl uVal ArgGl nHi sArgGl uAl aArgProAl aLeuLeuThrSerArg 620 



Fig. 11H 
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S-v,.,,.-cTCCGCTTCATCCCCAAGCCTGACGGGCTGCGGCC6ATTGTGAACATGGACTACGTCGTG 1920 

LeuArgPhelleProLysProAspGlyLeuArgProIleValAsnMetAspTyrValVal 640 

GGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCA 1980 

GlyAI aArgThrPheArgArgGl uLysArgAl aGl uArgLeuThrSerArgVal LysAl a 660 

CTGTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTG 2040 

LeuPheSerValLeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerVal 680 

CTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAG 2100 

LeuGl yLeuAspAspIl eHi sArgAl aTrpArgThrPheVal LeuArgVal ArgAl aGl n 700 

GACCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATC 2160 

AspProProProGluLeuTyrPheVal LysVal AspVal ThrGI yAl aTyrAspThrll e 720 

CCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGC 2220 

ProGlnAspArgLeuThrGluVal IleAl aSerll ell eLysProGlnAsnThrTyrCys 740 

GTGCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAG 2280 

Val ArgArgTyrAl aVal Val Gl nLysAl aAl aHi sGlyHi sVal ArgLysAl aPheLys 760 

AGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 2340 

SerHi sVal SerThrLeuThrAspLeuGl nProTy rMetArgGl nPheVal Al aHi s Leu 780 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAG 2400 

GlnGluThrSerProLeuArgAspAlaValVal IleGluGlnSerSerSerLeuAsnGlu 800 

GCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 2460 

AlaSerSerGlyLeuPheAspVal PheLeuArgPheMetCysHisHisAlaVal Argil e 820 

AGGGGCAAGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTG 2520 

ArgGlyLysSerTyrValGlnCysGlnGlylleProGlnGlySerlleLeuSerThrLeu 840 

CTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGAC 2580 

LeuCysSerLeuCysTyrGlyAspMetGluAsnLysLeuPheAlaGlylleArgArgAsp 860 

GGGCTGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG 2640 

GlyLeuLeuLeuArgLeuValAspAspPheLeuLeuValThrProHisLeuThrHisAla 880 

AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTG 2700 

LysThrPheLeuArgThrLeuValArgGlyValProGluTyrGlyCysValValAsnLeu 900 

CGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT 2760 

ArgLysThrVal Val AsnPheProVal Gl uAspGl uAl aLeuGlyGlyThrAl aPheVal 920 

CAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTG 2820 

GlnMetProAlaHisGlyLeuPheProTrpCysGlyLeuLeuLeuAspThrArgThrLeu 940 



Fig. Ill 
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Mar 2 2 2004 g 

V«. J 

^"^ ••■ GAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC 2880 

GluValGlnSerAspTyrSerSerTyrAlaArgThrSerlleArgAlaSerLeuThr'Phe 960 

AACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTG 2940 

AsnArgGlyPheLysAlaGlyArgAsnMetArgArgLysLeuPheGlyValLeuArgLeu 980 

AAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAAC 3000 

LysCysHisSerLeuPheLeuAspLeuGlnValAsnSerLeuGlnThrValCysThrAsn 1000 

ATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCA 3060 

IleTyrLysll eLeuLeuLeuGl nAl aTyrArgPheHi sAl aCysVal LeuGl nLeuPro 1020 

TTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCC 3120 

PheHi sGl nGl nVal TrpLysAsnProThrPhePheLeuArgVal IleSerAspThrAls 1040 

TCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGC 3180 

SerLeuCysTyrSerlleLeuLysAlaLysAsnAlaGlyMetSerLeuGlyAlaLysGly 1060 

GCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTC 3240 

Al aAl aGlyProLeuProSerGl uAl aVal Gl nTrpLeuCysHi sGl nAl aPheLeuLeu 1080 

AAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAG 3300 

LysLeuThrArgHisArgValThrTyrValProLeuLeuGlySerLeuArgThrAlaGln 1100 

ACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAAC 3360 

ThrGl nLeuSerArgLysLeuProGlyThrThrLeuThrAl aLeuGl uAl aAl aAl aAsn 1120 

CCGGCACTGCCCTCAGACTTCAAGACCATCCTGGAC 3420 

ProAlaLeuProSerAspPheLysThrlleLeuAsp 1132 



Fig. 11J 



Truncated protein 3 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGT6CTGCCGCTGGCCAC6TTCGTG 
H P R A P R C R A V R S L I R S H Y R E V L P L A T F V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G D P A A F R A L V A Q C L V C V P W D A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L Q R L C E R G A K N V L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F 1 T S V R S Y L P N T V I 0 A L R G S G A W G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V t V A P S C A Y Q V C G P P L Y G L G A A I Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S I P L P K R P R R G A A P E P E R T P V G Q G S W A H P G R T R G P S D R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R 0 H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWDIPCPPVYAEIKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCC C A6C CT GACTGGCGCTCGG AGGCTCGTGGAGACCATC TTTCTGGGT TCCAGGCCCTGGATGCC AGGGAC TCC C C GCAGGTTGCCCCGCCTGCCCC AGCGCT ACTGGCA 
S L R P S L T G A R R L V E T i F L G S R P W H P G T P R R L P R L P Q R Y H Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
M R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 I 0 P R R L V Q L L R Q H S S P W Q V Y G F V R A C 

CCTGCGCCGGCTG6TGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSIQEI 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKMSVRDCAWLRRSPGVGCVPAAEHRLREE1LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
HSVYVVELLRSFFYVTETTFQKNRLFFYRKSVHSKLQS1G 



Fig. UK 



AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALITSRLRFIPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N H 0 y V V G A R T F R R E K R A E R L T S R V K A L P S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L D D I H R A H R T F V L R V R A Q D P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K V D V T G A Y D T I P Q D R L T E V I A S I 1 K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTITOLQPYMRQFVAHLQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
AVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQC 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G 1 P Q G S 1 L S T L L C S L C Y G 0 H E N K L F A G I R R 0 G L L L R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
DFILVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFP 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E 0 E A L G G T A F V Q H P A H G L F P W C G L L L D T R T L E V Q S D Y S R 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 



TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA — 



Fig. 11L 



Altered C-terminus protein 



ATGCC6CGCGCTCCCCGCTGCC6AGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
M P R A P R C R A V R S L I R S H Y R E V L P L A T F V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R I G P Q G W R L V Q R G D P A A F R A L V A Q C L V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCIKELVARVIQRLCERGAKNVLAFGFALIDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F I 1 S V R S Y L P H T V I D A L R G S G A H G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A I F V L V A P S C A Y Q V C G P P L K Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAA6AGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A I S G I R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S I S R P P R P H D T P C P P V Y A E I K H F L Y S S G 0 K E G L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRIPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTDPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCAKTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
I R R I V P P G L W G S R H N E R R F L R H T K K F 1 S L G K H A K L 5 L Q E. L 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
TWKMSVROCAWIRRSPGVGCVPAAEHRLREEILAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
H S V Y V V E L L R S F F Y .V 1 E I T F Q K N R L F F Y R K S V H S K L Q S I G 



Fig. 11M 



AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
i R Q H L K R V Q L R E L S E A E V R Q H R E A R P A L L T S R L R F 1 P K P 0 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGA6CGTCTCACCTCGAGGGTGAA6GCACT6TTCAGCGTGCTCAACTACGA 
G L R p I ¥ N H 0 Y V V G A R T F R R E K R A E R L T S R V K A L f S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 D ! H R A U R T f V I R V R A Q 0 P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGA6GTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V n D V T G A Y 0 I I P 0 0 R L I E V I A S i I K P Q H T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPIRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G L F D V F L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAG6GCTCCATCCTCTCCACGCT6CTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGC6TTTGGTGGA 
Q G I P Q G S 1 L S T I L C S L C Y G 0 H E N K L F A G I R R 0 G L L L R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F I L V T P H L T H A K T F L R T L V R G V P E Y G C V V N I R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V [ 0 E A L G G T A F V Q H P A H G L F P W C G L L L 0 T R T L E V Q S D Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
YARTSIRASLTFNRGFKAGRNMRRKIFGVLRLKCHSLFLD 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
LQVNSIQTVCTNIYKILLLQAYRFHACVIQIPFHQQVWKN 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
PTFFLRVISDTASLCYSIIKAKNAE 



E E N 1 L V V T P A V L G S 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
G Q P E H E P P R R P S G V G S F P V S P G R G V G L G I * 



Fig. UN 



Protein that lacks motif A 



' ' " " ? " ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGC6CTCCCTGCTGCGCAGCCACTACCGCGA66TGCTGCCGCTGGCCACGTTCGTG 

H P R A P R C R A V R S L L R S H Y R E V L P L A T P V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G D P A A F R A L V A Q C L V C V P H 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRGVSCLKEIVARVLQRICERGAKNVIAFGFAIIDGAR 

CGGGGGCCCCCCC6AGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T 0 A L R G S G A H G L L L R R V G D D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGG1GCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCG6ACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACC6 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPHOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWHPGTPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFIELIGNHAQCPYGVLIKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T 0 P R R L V Q L L R Q H S S P W Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEL 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKHSVROCAWLRRSPGVGCVPAAEHRLREEILAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG 




Fig. no 



AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAA6CAGAG6TCA6GCAGCATCGGGAAGCCAGGCCCGCCCTGCT6ACGTCCA6ACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPO 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GIRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 0 1 H R A W R T F V t R V R A Q D P P P E L Y F 

TGICAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K D R I T E V I A S 1 I K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLTOIQPYHRQFVAHLQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G I F 0 V F L R F K C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
0 G i P Q G S I L S T L L C S L C Y G 0 H E N K L F A G I R R D G. L I L R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTG6TCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F I L V T P H L T H A K T F I R T L V R G V P E Y G C V V N L R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E 0 E A L G G T A F V Q H P A H G L F P W C G L L I 0 T R T L E V Q S 0 Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 

V A R T S I R A S L I F N R G F K A G R N M R R K L F G V L R L K C H S L F L D 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
IQVNSLQTVCTNIYKIILIQAYRFHACVLQLPFHQQVHKN 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T f F L R V I S 0 I A S I C Y S I L K A K N A G K S L G A K G A A G P L P S E 

GGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCC 
A V Q W L C H Q A F L L K L T R H R V T Y V P I L G S L R T A Q I Q I S R K L P 

GGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCC 
G T T L T A L E A A A N P A L P S D F K T I L D 

CTGTC ACGC CGGGC TCT ACGTCCC AGGGAGGGAGGGGC G6CC C AC ACCC AGGCCCGCAC CGCTGGGAGTCTG AGGCCTGAGTGAGTGTTT GGCCGAGGCCT GCATGTCCGGCTG AAGGCT 
GAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCAC 
CAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAA 



Fig. IIP 



6GACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGG6TCAAATTGGGGGGAGGTGCT6TGGGAGTAA 
AATACTGAATATATGAGTTTTTCAGTTTTGA 



Fig. iiq 



Truncated protein that lacks motif A 



ATGCCGC6CGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCT6GCCACGTTCGTG 
MPRAPRCRAVRSLIRSHYREVLPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G y R L V Q R G 0 P A A F R A L V A Q C L V C V P W D A R P P P A A 

CCCCTCCTTCC GCC AGGTGTCCTGCCTGAAGGAGCTGGTGGC CCGAGTGC TGCAGAGGC T GTGCGAGC GCGGCGCGAAGAAC GTGCTGGCCTT C GGCTTCGCGC TGC TGGAC GGGGCCCG 
PSFRQVSCLK-ELVARVLQRLCERGAKNVLAF6FALLD6AR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T D A L R G S G A W G L L I R R V G D 0 » 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P I Y Q L G A A T 0 A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
HASGPRRRLGCERAWNHSVREAGVPLGLPAPGARRRGGSA 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGIGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSHAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G f C V V S P A R P A E E A T S L E G A L S G T R H 5 H P S V G R 0 H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVET1FLGSRPWHPGTPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F I E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E D T D P R R L V Q L L R Q H S S P W Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSIQEI 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELLRSFFYVTETIFQKNRLFFYRKSVHSKLQSIG 



Fig. 11 R 



AATCAGACAGCACTTGAAGAG6GTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCG6GAAGCCA6GCCCGCCCTGCTGAC6TCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPO 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P 1 V N H 0 Y V V G A R T F R R E K R A E R L T S R V K A L F S V L K Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L D D I H R A W R T F V L R V R A Q D P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
if K 0 R L T E V 1 A S I I K P Q N 1 Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTITOLQPYMRQFVAHIQETSPLRO 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G L F 0 V F L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G 1 P Q G S I L S T L L C S L C Y G D H E N K L F A G I R R 0 G L L I R L V 0 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
DFLLVTPHLTHAKTFLRTLVRGVPEYGCYVNLRKJVVNFP 

TG1AGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
V E 0 E A L G G T A F V Q H P A H G L F P H C G L L L 0 T R T I E V Q 5 0 Y S R 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 
* 

TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA 



Fig. US 



MAR 2 2 2004 



Lacks motif A and altered C-termi nus 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGC6AGGTGCT6CC6CTGGCCAC6TTCGTG 
MPRAPRCRAVRSLLRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G D P A A F R A L V A Q C L V C V P W 0 A R P P P A A 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L Q R L C E R G A K N V L A F G F A L L D G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N I V T 0 A L R G S G A H G L L L R R V G D D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G Y P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRIRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W 0 T P C P P V Y A E T K H F L Y S S G D K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R P S L T G A R R L V E T I F L G S R P H H P G T P R R L P R L P Q R Y W Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T 0 P R R L V Q L L R Q H 5 S P W Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
L R R L V P P G I H G S R H N E R R F L R N I K K F 1 S L G K H A K L S t Q E L 

GACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
1 W K H S V R 0 C A W L R R S P G V G C V P A A E H R L R E E 1 L A K F L H H L 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG 



Fig. 11T 



AATCAGACAGCACTTGAAGAGG6TGCAGCTGCGGGAGCT6TCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAA6CCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPO 

CGGGCT6C6GCCGATTGTGAACATGGACTACGTC6TG66AGCCAGAACGTTCC6CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGG6TGAAGGCACTGTTCA6C6TGCTCAACTACGA 
GIRPMNMDYVVGARTFRREKRAERLTSRVKALFSVINYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V I G L D 0 I H R A V R T F V L R V R A Q D P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K DRLTEVIASI IKPQNTYCVRRYAYVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTITOLQPYMRQFVAHIQETSPIRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTICCTACGCTFCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGFCCTACGTCCAGFG 
A V V I E Q S S S L N E A S S G L F D V F L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
Q G I P Q G S I I S T I I C $ L C Y G D H E N K I F A G I R R D G L L I R L V D 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGC6AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F I L V T P H L I H A K T F L R T L V R G V P E Y G C V V N L R K I V V N f P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E 0 E A L G G T A F V Q H P A H G L F P H C G L L L 0 T R T L E V 0 S D Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
YARTS1RASLTFNRGFKAGRNHRRKLFGVLRLKCHSLFLD 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
LQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVHKN 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T F F L R V I S 0 T A S L C Y S 1 L K A K N A E 



E E N 1 L V V T P A V L G S 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
GQPEHEPPRRPSGVGSFPVSPGRGVGLGL* 



Fig. 11 U 



Truncated telomerase (ver. 2) 



ATGCCGC6CGCTCCCCGCTGCCGA6CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCC6CTGGCCACGTTCGTG 
HPRAPRCRAVRSLLRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G 0 P A A P R A I V A Q C L V C V P V 0 A R P P P A A 

GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G L P G V G V R L G L R A A G G N Q R H A E S S A G 0 S G R F P R R 
A S P G S A S G W G * G R P G G T S 0 H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPAICGEQRRRIRAIPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L 0 R L C E R G A K N V L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCG6GGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A f T 1 S V R S y L P N T V T D A L R G S G A W G L I L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q I G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
HASGPRRRLGCERAWNHSVREAGVPLGLPAPGARRRGGSA 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSHAHPGRTRGPSOR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWOTPCPPVYAETKHFLYSSGOKEQLRPSFLLS 

CTC TC TGAGGCCC AGC C TGACTGGC GCTCGGAGGCTC6TGG AGACC ATCT T TCT GG6TTCCAGGCC CTGGATGCC AGGGACTCC CCGC AGGTTGCCCCGC CTGCCCCAGCGCT ACTGGCA 
SLRPSLTGARRLVET1FLGSRPWMPGTPRRLPRIPQRYHQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTOPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
IRRLVPPGLWGSRHNERRFIRNTKKFISLGKHAKLSLQEI 



Fig. 11 V 



6ACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCC6GCC6CAGAGCACCGTCTGC6TGAGGAGATCCTGGCCAAGTTCCTGCACTGGCT 
THKMSVROCAHLRRSPGVGCV PAAEHRLRE E I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACG6AGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATT6G 
M S V y V V E L L R S F F Y V T E I T F Q K N R L f F V R K S V H S K L Q S I G 

AAT- -NNN- -GACAGTCACCAGGGGGGTTGACCGCCGGACTGGGCGTCCCCAGGGTTGACTATAGGACCAGGTGTCCAGGTGCCCTGCAAGTAGAGGGGCTCTCAGAGGCGTCTGGCTGG 



CATGGGTGGACGTGGCCCCGGGCATGGCCTTCTGCGTGTGCTGCCGTGGGTGCCCTGAGCCCTCACTGAGTCGGTGGGGGCTTGTGGCTTCCCGTGAGCTTCCCCCTAGTCTGTTGTCTG 



GCTGAGCAAGCCTCCTGAGGGGCTCTCTATTG_ 



Fig. 11W 



Truncated protein 1 (ver. 2) 



ATGCCGC6CGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGA6GTGCTGCCGCTGGCCAC6TTCGTG 
NPRAPRCRAVRSLLRSHYREVLPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G 0 P A A F R A L V A Q C L V C V P H D A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G L P G V G V R L G L R A A G G H 0 R H A E S S A G 0 S G R F P R R 
A S P G S A S G W G * G R P G G T S 0 M R R A A Q A T Q G A S P A G 
P P R G R R P A G V [ G G R G E P A T C G E 0 R R R L R A L P P Q V 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVIQRICERGAKNVLAFGFALLDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGIGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N I V T 0 A L R G S G A W G L L I R R V G D 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P L P K R P R R G A A P E P E R I P V G Q G S V A H P G R T R G P S 0 R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A I S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPHOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SI-RPSLTGARRIVETIFIGSRPWMPGTPRRIPRIPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E t I G N H A Q C P Y G V L L K T H C P L R A A V T P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
F-KPQGSVAAPEEEDTDPRRLVQLIRQHSSPHQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGMCGCCGCTTCCTCAGGMCACCMGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLUGSRHNERRFLRFfTKKF I S LGKKAKLSLQEL 



Fig. 11X 



GACGTGGAAGAT6A6C6TGCGGGACT6C6CTTGGCTGC6CA6GAGCCCAGG66TTGGCT6TGTTCCGGCCGCAGA6CACCGTCTGCGTGAGGAGATCCT66CCAAGTTCCTGCACTGGCT 
THKMSVROCAWLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GAT6AGTGTGTAC6TCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCG6AAGAGTGTCTG6A6CAAGTTGCAAAGCATTGG 
M S V y V V E L L R S F F Y V T E T T F Q K N R L F F Y R K S V W S K L 0 5 I G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRF1PKPD 



GTGGCTGTGCTTTGGTTTAACTTCCTTTTTAACCAGAA 
V A V L H F I F L F N Q K 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GLRPIVHHDYVVGARTFRREKR P S V S F R G * 



Fig. 11Y 



Truncated protein 2 (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGC6CTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCAC6TTCGTG 
MPRAPRCRAVRSLLRSHYREVLPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G D P A A F R A L V A Q C L V C V P H D A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
GLPGVGVRLGLRAAGGNQRHAESSAGDSGRFPRR 
A S P G S A S G W G * G R P G G T S D H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRAIPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
P S F R Q V S C L K E L V A R V L Q R L C E R G A K N V L A F G F A L L 0 G A R 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V I 0 A L R G S G A H G L L L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPIPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPWOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWHPGTPRRLPRLPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
HRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTDPRRLVQLLRQHSSPWQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEl 



Fig. 11 Z 



GACGT66AAGATGAGCGTGCGGGACTGCGCTT6GCTGC6CAGGAGCCCA66GGTTG6CT6TGTTCCGGCCGCAGAGCACC6TCTGCGTGAGGAGATCCTGGCCAA6TTCCT6CACTGGCT 
THKMSVRDCAWLRRSPGVGCVPAAEHRLRE E I LAKFLHHL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
M S V Y V V E L L R S F F Y V T E T T F Q K N R L F F Y R K S V W S K I Q S I G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRF1PKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 

GLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVINYE 

j 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G l L G A S V I G L D D 1 H R A H R T F V L R V R A Q 0 P P P E I Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
VKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCAC 
K A A H G H V R K A F K S H 

GTCC1ACGTCCAGTG 
V L R P V 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
P G 0 P A G L H P L H A A L Q P V I R R H G E Q A V C G 0 S A G R A A P A F G G 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 



Fig. 11AA 



Reference protein (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAG 60 
MetProArgAlaProArgCysArgAlaValArgSerLeuLeuArgSerHisTyrArgGlu 20 

GTGCTGCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAG 120 
Val LeuProLeuAl aThrPheVal ArgArgLeuGlyProGl nGlyTrpArgLeuVal Gl n 40 

CGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGG 180 
ArgGlyAspProAl aAl aPheArgAl aLeuVal Al aGl nCysLeuVal CysVal ProTrp 60 

GACGCACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTG 
AspAlaArgProProProAlaAlaProSerPheArgGlnVal 

GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAG 
GLPGV.GVRLGLRAAGGNQRHAE 
ASPGSASGWG*GRPGGTSDMRR 
PPRGRRPAGVEGGRGEPATCGE 

AGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
SSAGDSGRFPRR 
AAQATQGASPAG 
QRRRLRALPPQV 



TCCTGCCTGAAGGAGCTG 240 
SerCysLeuLysGluLeu 80 

GTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGC 300 

ValAl aArgVal LeuGl nArgLeuCysGl uArgGl yAl a LysAsnVal LeuAl aPheGly 100 

TTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGC 360 

PheAlaLeuLeuAspGlyAlaArgGlyGlyProProGluAlaPheThrThrSerValArg 120 

AGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTG 420 

SerTyrLeuProAsnThrValThrAspAlaLeuArgGlySerGlyAlaTrpGlyLeuLeu 140 

TTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTG 480 

LeuArgArgValGlyAspAspVal LeuVal Hi s LeuLeuAl aArgCysAl a LeuPheVal 160 

CTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCT 540 

LeuVal AlaProSerCysAlaTyrGlnValCysGlyProProLeuTyrGlnLeuGlyAl a 180 

GCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAA 600 

AlaThrGlnAlaArgProProProHisAlaSerGlyProArgArgArgLeuGlyCysGlu 200 




Fig. 11AB 



CCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGT 660 

rgAlaTrpAsnHisSerVal ArgGluAl aGlyVal ProLeuGlyLeuProAl aProGly 220 

GCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGT 720 

AlaArgArgArgGlyGlySerAlaSerArgSerLeuProLeuProLysArgProArgArg 240 

GGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGC 780 

GlyAlaAlaProGl uProGl uArgThrProVal Gl yGl nGlySerTrpAl aHi sProGI y 260 

AGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAA 840 

ArgThrArgGlyProSerAspArgGlyPheCysValValSerProAlaArgProAlaGlu 280 

GAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGC 900 

Gl uAl aThrSerLeuGl uGlyAl aLeuSerGl yThrArgHi sSerHi sProSerVal Gl y 300 

CGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCCTGGGACACGCCT 960 

ArgGlnHisHisAlaGlyProProSerThrSerArgProProArgProTrpAspThrPro 320 

TGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAG 1020 

CysProProValTyrAlaGluThrLysHisPheLeuTyrSerSerGlyAspLysGluGln 340 

CTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTC 1080 

LeuArgProSerPheLeuLeuSerSerLeuArgProSerLeuThrGlyAl aArgArgLeu 360 

GTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCC 1140 

ValGluThrllePheLeuGlySerArgProTrpMetProGlyThrProArgArgLeuPro 380 

CGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCAC 1200 

ArgLeuProGl nArgTyrTrpGl nMetArgProLeuPheLeuGl uLeuLeuGlyAsnHi s 400 

GCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACC 1260 

Al aGl nCysProTyrGlyVal LeuLeuLysThrHi sCysProLeuArgAl aAl aValThr 420 

CCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAG 1320 

ProAlaAlaGlyValCysAl aArgGl uLysProGI nGlySerVal Al aAl aProGl uGl u 440 

GAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAG 1380 

Gl uAspThrAspProArgArgLeuVal Gl nLeuLeuArgGl nHi sSerSerProTrpGl n 460 

GTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCC 1440 

ValTyrGlyPheValArgAlaCysLeuArgArgLeuValProProGlyLeuTrpGlySer 480 

AGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCAT 1500 

ArgHisAsnGl uArgArgPheLeuArgAsnThrLysLysPhell eSerLeuGlyLysHi s 500 

GCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGGCTGCGCTTGGCTG 1560 

Al a LysLeuSerLeuGl nGl uLeuThrTrpLysMetSerVal ArgAspCysAlaTrpLeu 520 



Fig. 11 AC 



, ^aS>CGCAG6AGCCCAGGGGTTGGCTGTGTTCC6GCCGCAGAGCACCGTCTGCGTGAGGAGATC 1620 

ArgArgSerProGlyVal GlyCysVal ProAl aAl aGl uHi s ArgLeuArgGl uGl ull e 540 

CTGGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTC 1680 

LeuAlaLysPheLeuHi sTrpLeuMetSerValTyrVal ValGluLeuLeuArgSerPhe 560 

TTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTC 1740 

PheTyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerVal 580 

TGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAG 1800 

TrpSer Lys LeuGl nSer 1 1 eGlyll eArgGlnHi sLeuLysArgValGl nLeuArgGl u 600 

CTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGA 1860 

LeuSerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArg 620 

CTCCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTG 1920 

LeuArgPhelleProLysProAspGlyLeuArgProIleVal AsnMetAspTyrVal Val 640 

GGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCA 1980 

GlyAl aArgThrPheArgArgGl uLysArgAl aGl uArgLeuThrSerArgVal LysAl a 660 

CTGTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTG 2040 

LeuPheSerVal LeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerVal 680 

CTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAG 2100 

LeuGlyLeuAspAspIl eHi sArgAl aTrpArgThrPheVal LeuArgVal ArgAl aGl n 700 

GACCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATC 2160 

AspProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrlle 720 

CCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGC 2220 

ProGlnAspArgLeuThrGluVal IT eAl aSerl 1 el 1 eLysProGl nAsnThrTy rCys 740 

GTGCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAG 2280 

ValArgArgTyrAlaVal Val Gl nLysAl aAl aHi sGlyHi sVal ArgLysAl aPheLys 760 

AGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTG 2340 

SerHisValSerThrLeuThrAspLeuGlnProTyrMetArgGlnPheValAlaHisLeu 780 

CAGGAGACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAG 2400 

GlnGluThrSerProLeuArgAspAlaValVal IleGluGlnSerSerSerLeuAsnGlu 800 

GCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 2460 

Al aSerSerGlyLeuPheAspVal PheLeuArgPheMetCysHi sHi sAl aVal Argil e 820 

AGGGGCAAGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTG 2520 

ArgGlyLysSerTyrVal Gl nCysGl nGlyll eProGl nGlySerll eLeuSerThrLeu 840 



Fig. HAD 



^ ^j^TCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGAC 2580 

"^JaflB^LeuCysSerLeuCysTyrGlyAspMetGluAsnLysLeuPheAlaGlylleArgArgAsp 860 

GGGCTGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCG 2640 

GlyLeuLeuLeuArgLeuValAspAspPheLeuLeuValThrProHisLeuThrHisAla 880 

AAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTG 2700 

LysThrPheLeuArgThrLeuValArgGlyValProGluTyrGlyCysValValAsnLeu 900 

CGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTT 2760 

ArgLysThrVal Val AsnPheProValGl uAspGl uAl aLeuGlyGlyThrAl aPheVal 920 

CAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTG 2820 

GlnMetProAlaHisGlyLeuPheProTrpCysGlyLeuLeuLeuAspThrArgThrLeu 940 

GAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTC 2880 

Gl uVal Gl nSerAspTyrSerSerTyrAl aArgThrSerl 1 eArgAl aSerLeuThrPhe 960 

AACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTG 2940 

AsnArgGlyPheLysAl aGlyArgAsnMetArgArgLysLeuPheGlyVal LeuArgLeu 980 

AAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAAC 3000 

LysCysHisSerLeuPheLeuAspLeuGlnValAsnSerLeuGlnThrValCysThrAsn 1000 

ATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCA 3060 

II eTyrLys II eLeuLeuLeuGl nAl aTyrArgPheHi sAl aCysVal LeuGl nLeuPro 1020 

TTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCC 3120 

PheHi sGlnGl nVal TrpLysAsnProThrPhePheLeuArgVal IleSerAspThrAls 1040 

TCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGC 3180 

SerLeuCysTyrSerll eLeuLysAl aLysAsnAl aGlyMetSerLeuGlyAl aLysGly 1060 

GCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTC 3240 

AlaAlaGlyProLeuProSerGluAlaValGlnTrpLeuCysHisGlnAlaPheLeuLeu 1080 

AAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAG 3300 

LysLeuThrArgHi sArgVal ThrTy rVal ProLeuLeuGlySerLeuArgThrAlaGln 1100 

ACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAAC 3360 

ThrGlnLeuSerArgLysLeuProGlyThrThrLeuThrAlaLeuGluAlaAlaAlaAsn 1120 

CCGGCACTGCCCTCAGACTTCAAGACCATCCTGGAC 3420 

ProAlaLeuProSerAspPheLysThrll eLeuAsp 1132 



Fig. 11AE 



Truncated protein 3 (ver. 2) 



ATGCCGCGCGCTCCCCGCT6CCGAGCCGTGCGCTCCCTGCTGC6CAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
HPRAPRCRAVRSLLRSHYREVIPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGT6CGT6CCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G D P A A F R A L V A Q C L V C V P W 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G L P G V G V R L G L R A A G G N Q R H A E S S A G 0 S G R F P R R 
A S P G S A S G H G * G R P G G T S 0 H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRAIPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALLOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T D A L R G S G A W G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y 0 L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A V N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P t P K R P R R G A A P E P E R T P V G Q G S H A H P G R I R G P S 0 R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPHDTPCPPVYAETKHFLYSSGOKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWHPGTPRRLPRLPQRYWQ 

AATGC GGCCC CTGTTTCTGGAGC TGCTTGGGAACCAC GCGC AGTGCCCCT ACGGGGTGC TCCTCAAGACGC ACTG CC CGC TGCG AGCTGC GGTC ACC C C AGC AGCCGGTGTC TGTGC CCG 
HRPLFLELIGNHAQCPYGVLLKIHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
EKPQGSVAAPEEEOTOPRRLVQLLRQHSSPHQVYGFVRAC 

CCTGCGCCGGGTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
I R R L V P P G L H G S R H N E R R F L R N T K K F I S L G K H A K L S I Q E I 



Fig. 11 AF 



GACGTGGAAGATGAGCGTGCGGGACTGC6CTTGGCTGCGCAGGAGCCCAGGG6TTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGC6TGAGGAGATCCTGGCCAAGTTCCTGCACT6GCT 
TWKMSVRDCAHLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVEllRSFFYVTETTFQKNRLFFYRKSVWSKLQSiG 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
I R Q H L K R V Q L R E L S E A E V R Q H R E A R P A I I T S R L R F ! P K P D 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N H 0 Y V V G A R T F R R E K R A E R L T S R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 D 1 H R A W R T f V L R V R A Q 0 P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
V K V 0 V T G A Y 0 T I P Q D R L I E V I A S I I K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
K A A H G H V R K A F K S H V S I L T D L Q P Y N R Q F V A H L Q E T S P L R 0 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S t N E A S S G L F D V F L R F H C H H A V R I R G K S Y V 0 C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QGIPQGSILSTLLCSLCYGDMENKIFAGIRROGLIIRLVD 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
DFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFP 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
VEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSOYSR 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 
* 

TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA — 



Fig. 11 AG 



Altered C-terminus protein (ver. 2) 



AT6CCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
MPRAPRCRAVRSILRSHYREVLPIATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G 0 P A A F R A L V A Q C L V C V P W 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G L P G V G V R L G L R A A G G N Q R H A E S S A G 0 S G R F P R R 
A S P G S A S G H G * G R P G G T S 0 H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEGRRRIRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALLOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T D A L R G S G A W G L I L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L T Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
S R S L P L P K R P R R G A A P E P E R T P V G Q G S H A H P G R I R G P S 0 R 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P W 0 T P C P P V Y A E T K H F L Y S S G D K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R P S L T G A R R L V E T I F L G S R P W H P G T P R R L P R I P Q R Y V 0 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFIEILGNHAQCPYGVILKTHCPLRAAVIPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T 0 P R R L V Q L L R Q H S S P W Q V Y G f V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKISIQEL 



Fig. 11 AH 



GACGT6GAAGATGAGC6TGCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGGGTTGGCTGTGTTCC66CCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCTGCACT6GCT 
TWKMSVRDCAWLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGA6TGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACA6GCTCTTTTTCTACCGGAAGAGTGTCTGGA6CAAGTTGCAAAGCATTGG 
H S V Y V V E L L R $ F f Y V T E T T F Q K N R L F F Y R K S V W S K I Q S I G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRF1PKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GLRPIVNMDYVVGARIFRREKRAERLTSRVKAIFSVLNYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 0 I H R A W R T F V L R V R A Q 0 P P P E L Y F 

TGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
VKVDVTGAYDTIPQDRLTEVIASMKPQNTYCVRRYAVVQ 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRD 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G L F 0 V F L R f H C H H A V R 1 R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QGIPQGSILSTLLCSLCYG0MENKLFAG1RR0GLLLRLVD 

TGATTTCTT6TTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F I I V T P H L I H A K T F I R T L V R G V P E Y G C V V N L R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
VEDEALGGTAFVQMPAHGLFPWCGLILDTRTLEVOSOYSS 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
Y A R T S i R A S L T F N R G F K A G R N M R R K L F G V L R L K C H S L F L D 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
L Q V N S I Q T V C T N I Y K I L L L Q A Y R F H A C V L Q L P F H Q Q V W K N 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
PTFFLRV1SOTASLCYS1LKAKNAE 

CCGAAGAAAACATTTCTGTCGTGACTCCTGCGGTGCTTGGGTC 
E E N I L V V T P A V L G S 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
G Q P E H E P P R R P S G V G S F P V $ P G R G V G L G I * 



Fig. 11AI 



Protein that lacks motif A (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
H P R A P R C R A V R S L I R S H Y R £ V L P L A T F V 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G W R L V Q R G 0 P A A F R A L V A Q C I V C V P W 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
G I P G V G V R I G L R A A G G N Q R H A E S S A G D $ G R F P R R 
A S P G S A S G W G * G R P G G T S 0 H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRtCERGAKNVLAFGFALLOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T 0 A L R G S G A W G L L L R RJ G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
L V H L L A R C A L F V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A H N H S V R E A G V P L G L P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSNAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G F C ¥ V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
5 T S R P P R P W 0 T P C P P V Y A E T K H F L Y S S G 0 K E Q L R P S F L L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
S L R P S -L T G A R R L V E T I F L G S R P H H P G T P R R L P R I P Q R Y H Q 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFIEILGNHAQCPYGVLIKTHCPIRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P Q G S V A A P E E E 0 T 0 P R R I V Q L L R Q H S S P H Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLHGSRHNERRFLRNTKKFISLGKHAKLSLQEL 



Fig. 11AJ 



GACGTGGAAGAT6AGCGTGCG6GACT6CGCTTGGCTGC6CAGGAGCCCAG6GGTTGGCTGTGTTCCGGCC6CAGAGCACCGTCTGCGTGAGGA6ATCCT6GCCAAGTTCCTGCACTGGCT 
THKMSVRDCAWLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
MSVYVVEILRSFFYVTETTFQKNRLFFYRKSVNSKLQSIG 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALITSRLRFIPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
GLRPIVNHDYVVGA'RTFRREKRAERLTSRVKALFSVLNYE 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L D D 1 H R A «■ R T F V L R V R A Q 0 P P P E L Y F 

TGICAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
I ( 0 R L T E V I A S I I K P Q N T Y C V R R Y A V V 0 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCIGCAGGAGACCAGCCCGCTGAGGGA 
K A A H G H V R K A F K S H V S T L T D L Q P Y'H R Q F V A H L Q E T S P L R 0 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V 1 E Q S S S L N E A S S G L F 0 V F L R F N C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QGIPQGSILSTLLCSLCYGOHENKLFAGIRRDGLLLRLVD 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F L I V T P H I T H A K T F I R T L V R G V P E Y G C V V N I R K T V V N F P 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCGCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E D E A L G G T A F V Q N P A H G L F P V C G L L L 0 T R T L E V Q S 0 Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 

Y A R T S i R A S L I F N R G F K A G R N N R R K I F G V I R L K C H S L F L 0 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
LQVNSIQTVCTNIYKIILLQAYRFHACVLQLPFHQQVHKN 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
P T F F L R V 1 S 0 I A S L C Y S I L K A K N A G N S L G A K G A A G P L P S E 

GGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCC 
AVQHICHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLP 

GGGGACGACGC1GACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCC 
GTTLTALEAAANPALPSDFKTiLO 



Fig. 11 AK 



CTGTCAC6CCGG6CTCTAC6TCCCAGG6AGGGAGGGGCGGCCCACACCCAG6CCC6CACCGCTGGGAGTCTGAG6CCT6AGTGA6TGTTTG6CC6AGGCCTGCATGTCCGGCT6AAGGCT 
GAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCAC 
CAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAA 
GGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAA 
AATACTGAATATATGAGTTTTTCAGTTTTGA 



Fig. HAL 



Truncated protein that lacks motif A (ver. 2) 



ATGCCGCGCGCTCCCC6CTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTG 
MPRAPRCRAVRSLIRSHYREVLPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P 0 G V R L V Q R G 0 P A A F R A L V A Q C L V C V P W 0 A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
GLPGVGVRLGLRAAGGNQRHAESSAGDSGRFPRR 
A S P G S A S G W G * G R P G G T S 0 N R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKEIVARVIQRLCERGAKNVIAFGFALLDGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGIGGGCGACGACGT 
G G P P [ A F T T S V R S T L P N T V T D A L R G S G A H G L L L R R V G 0 0 V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I V H I L A R C A L F V L V A P S C A V Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
HASGPRRRLGCER'AWNHSVREAGVPLGLPAPGARRRGGSA 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSOR' 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
G f C V V S P A R P A E E A T S L E G A L S G T R H S H P S V G R Q H H A G P P 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
S T S R P P R P H 0 1 P C P P V Y A E T K H F L Y S S G D K E Q L R P S F I L S 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWMPGIPRRLPRLPQRYHQ 

AAT6CGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
H R P L F L E L L G N H A Q C P Y G V L L K T H C P, L R A A V I P A A G V C A R 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCT6 
EKPQGSVAAPEEEOTOPRRLVQLLRQKSSPHQVYGFVRAC 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLWGSRHNERRFLRNTKKFiSLGKHAKLSLQEL 



Fig. 11AM 



6ACGTGGAAGATGA6C6TGCGGGACT6CGCTTGGCT6CGCA6GAGCCCAG6GGTTGGCT6TGTTCCGGCCGCAGAGCACC6TCTGCGTGAGGAGATCCTGGCCAAGTTCCT6CACTGGCT 
TWKMSVRDCAHLRRSPGVGCVPAAEHRLREEILAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGA6CAAGTTGCAAAGCATTGG 
H S V y V V E L L R S F F Y V T E T T F Q K H R L f F Y R K S V W S K L Q S I G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPD 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G L R P I V N H D Y V V 6 A R T F R R E K R A E R L T S R V K A L F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L D 0 I H R A H R T F V L R V R A Q 0 P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 

V K 0 R L T E V I A S I I K P Q N T Y C V R R Y A V V Q 

GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTITOLQPYHRQFVAHLQETSPLRO 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
A V V I E Q S S S L N E A S S G L F 0 If F L R F H C H H A V R I R G K S Y V Q C 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QG1PQGSILSTLLCSLCYGDHENKLFAGIRRDGLLLRLVD 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
D F L L V T P H L T H A K T F L R T L V R G V P E Y G C V V N L R K T V V N F P' 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 

V E D E A L G G T A F V Q H P A H G L F P H C G L L L 0 T R T L E V Q S D Y S R 



GTGAGCGCACCTGGCCGGAAGTGGAGCCTGTGCCCGGCTGGGGCAGGTGCTGCTGCAGGGCCGTTGCGTCCACCTCTGCTTCCGTGTGGGGCAGGCGACTGCCAATCCCAAAGGGTCAGA 
* 

TGCCACAGGGTGCCCCTCGTCCCATCTGGGGCTGAGCACAAATGCATCTTTCTGTGGGAGTGAGGGTGCCTCACAACGGGAGCAGTTTTCTGTGCTATTTTGGTAA 



Fig. 11 AN 



Lacks motif A and altered C-terminus (ver. 2) 



ATGCCGCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCT6CGCAGCCACTACCGCGAGGTGCTGCCGCT6GCCACGTTC6TG 
MPRAPRCRAVRSLLRSHYREVLPLATFV 

CGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGC 
R R L G P Q G H R L V Q R G 0 P A A F R A I V A Q C L ¥ C V P V D A R P P P A A 



GGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGTG 
GLPGVGVRLGLRAAGGNQRHAESSAGDSGRFPRR 

A S P G S A S G W G * G R P G G T S 0 H R R A A Q A T Q G A S P A G 
PPRGRRPAGVEGGRGEPATCGEQRRRLRALPPQV 

CCCCTCCTTCCGCCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAAGAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCG 
PSFRQVSCLKELVARVLQRLCERGAKNVLAFGFAILOGAR 

CGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGT 
G G P P E A F T T S V R S Y L P N T V T D A L R G S G A V G L L L R R V G 0 D V 

GCTGGTTCACCTGCTGGCACGCTGCGCGCTCTTTGTGCTG6TGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCC 
I V H L L A R C A LF V L V A P S C A Y Q V C G P P L Y Q L G A A T Q A R P P P 

ACACGCTAGTGGACCCCGAAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGC 
H A S G P R R R L G C E R A W N H S V R E A G V P L G I P A P G A R R R G G S A 

CAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCG 
SRSLPLPKRPRRGAAPEPERTPVGQGSHAHPGRTRGPSDR 

TGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCC 
GFCVVSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPP 

ATCCACATCGCGGCCACCACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAG 
STSRPPRPHOTPCPPVYAETKHFLYSSGDKEQLRPSFLLS 

CTCTCTGAGGCCCAGCCTGACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGGGACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCA 
SLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRIPQRYWQ 

AATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCG 
MRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAR 

GGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTG 
E K P 0 G S V A A P E E E 0 I 0 P R R L V Q I L R Q H S S P H Q V Y G F V R A C 

CCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCT 
LRRLVPPGLHGSRHNERRFLRNTKKFISIGKHAKLSLQEL 



Fig. 11 AO 



GACGTGGAAGATGAGCGTGCGGGACT6CGCTTGGCTGCGCAGGAGCCCAGG6GTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGTTCCT6CACTGGCT 
TWKMSVRDCAWLRRSPGVGCVPAAEHRLREE I LAKFLHWL 

GATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGG 
M $ V Y V V E L L R $ F f Y V T E T T F Q K N R L F F Y R K $ V W S K I Q S I G 

AATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGA 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPO 

CGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCTCAACTACGA 
G I R P I V N M D Y V V G A R T F R R E K R A E R L T $ R V K A I F S V L N Y E 

GCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGCCTGAGCTGTACTT 
R A R R P G L L G A S V L G L 0 D 1 H R A W R T F V L R V R A 0 0 P P P E L Y F 

TGTCAAG GACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGTATGCCGTGGTCCA 
V K 0 R L T E V I A S I I K P Q N T Y C V R R Y A V V Q 



GAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCTCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAGACCAGCCCGCTGAGGGA 
KAAHGHVRKAFKSHVSTLTOLQPYHRQFVAHLQETSPLRO 

TGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGTCCTACGTCCAGTG 
AVVIEQSSSLNEASSGLFDVFLRFHCHHAVRIRGKSYVQC 

CCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCCTGCGTTTGGTGGA 
QGiPQGSILSTLLCSLCYGOHENKLFAGlRRDGLLLRLVO 

TGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAG6ACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCC 
DFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFP 

TGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAG 
V E 0 E A L G G T A F V Q H P A H G L F P H C G L L L D T R T L E V Q S 0 Y S S 

CTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGA 
YARTSIRASLTFNRGFKAGRNHRRKLFGVLRLKCHSLFLD 

TTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAA 
L Q V N S I Q T V C T N I Y M L L L Q A Y R F H A C V I Q L P F H 0 Q V W K N 

CCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGA 
PTFFLRVIS0TASLCYS1LKAKNAE 



EENILVVTPAVLGS 

GGGACAGCCAGAGATGGAGCCACCCCGCAGACCGTCGGGTGTGGGCAGCTTTCCGGTGTCTCCTGGGAGGGGAGTTGGGCTGGGCCTGTGACTCCTCAGCCTCTGTTTTCCCCCAG 
GQPEHEPPRRPSGVGSFPVSPGRGVGLGL* 
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LOCUS pAKI28.4 7615 bp dsONA Circular 

DEFINITION Human telomerase clone with exon beta spliced out 



1 tcgacctgca ggcatgcaag cttggcactg gccgtcgttt tacaacgtcg tgactgggaa 
61 aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 
121 aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 
181 tggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcataaat 
241 tccctgtttt ggcggatgag agaagatttt cagcctgata cagattaaat cagaacgcag 
301 aagcggtctg ataaaacaga atttgcctgg cggcagtagc gcggtggtcc cacctgaccc 
361 catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc 
421 gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct 
481 ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat ccgccgggag 
541 cggatttgaa cgttgcgaag caacggcccg gagggtggcg ggcaggacgc ccgccataaa 
601 ctgccaggca tcaaattaag cagaaggcca tcctgacgga tggccttttt gcgtttctac 
661 aaactcttcc tgtcgtcata tctacaagcc atccccccac agatacggta aactagcctc 
721 gtttttgcat caggaaagca gggaatttat ggtgcactct cagtacaatc tgctctgatg 
781 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 
841 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 
901 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 
961 ttttataggt taatgtcatg ataataatgg tttcttagac gtgaggttct gtacccgaca 
1021 ccatcgaatg gtgcaaaacc tttcgcggta tggcatgata gcgcccggaa gagagtcaat 
1081 tcagggtggt gaatgtgaaa ccagtaacgt tatacgatgt cgcagagtat gccggtgtct 
1141 cttatcagac cgtttcccgc gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg 
1201 aaaaagtgga agcggcgatg gcggagctga attacattcc caaccgcgtg gcacaacaac 
1261 tggcgggcaa acagtcgttg ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc 
1321 cgtcgcaaat tgtcgcggcg attaaatctc gcgccgatca actgggtgcc agcgtggtgg 
1381 tgtcgatggt agaacgaagc ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg 
1441 cgcaacgcgt cagtgggctg atcattaact atccgctgga tgaccaggat gccattgctg 
1501 tggaagctgc ctgcactaat gttccggcgt tatttcttga tgtctctgac cagacaccca 
1561 tcaacagtat tattttctcc catgaagacg gtacgcgact gggcgtggag catctggtcg 
1621 cattgggtca ccagcaaatc gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc 
1681 tgcgtctggc tggctggcat aaatatctca ctcgcaatca aattcagccg atagcggaac 
1741 gggaaggcga ctggagtgcc atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg 
1801 gcatcgttcc cactgcgatg ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg 
1861 ccattaccga gtccgggctg cgcgttggtg cggatatctc ggtagtggga tacgacgata 
1921 ccgaagacag ctcatgttat atcccgccgt taaccaccat caaacaggat tttcgcctgc 
1981 tggggcaaac cagcgtggac cgcttgctgc aactctctca gggccaggcg gtgaagggca 
2041 atcagctgtt gcccgtctca ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa 
2101 ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac 
2161 tggaaagcgg gcagtgagcg caacgcaatt aatgtaagtt agctcactca ttaggcaccc 
2221 caggctttac actttatgct tccgacctgc aagaacctca cgtcaggtgg cacttttcgg 
2281 ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg 
2341 ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt 
2401 attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt 
2461 gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg 
2521 ggttacatcg agaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 
2581 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 
2641 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 
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2701 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 
2761 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 
2821 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 
2881 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 
2941 tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 
3001 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 
3061 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 
3121 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 
3181 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 
3241 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 
3301 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 
3361 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 
3421 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 
3481 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 
3541 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 
3601 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 
3661 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 
3721 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 
3781 gaacgaccta caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc 
3841 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 
3901 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 
3961 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 
4021 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct 
4081 ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata 
4141 ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc 
4201 gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agaattaatt 
4261 ctcatgtttg acagcttatc atcgactgca cggtgcacca atgcttctgg cgtcaggcag 
4321 ccatcggaag ctgtggtatg gctgtgcagg tcgtaaatca ctgcataatt cgtgtcgctc 
4381 aaggcgcact cccgttctgg ataatgtttt ttgcgccgac atcataacgg ttctggcaaa 
4441 tattctgaaa tgagctgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag 
4501 cggataacaa tttcacacag gaaacagcga tgaattcaga tctcaccatg aaggagctgg 
4561 tggcccgagt gctgcagagg ctgtgcgagc gcggcgcgaa gaacgtgctg gccttcggct 
4621 tcgcgctgct ggacggggcc cgcgggggcc cccccgaggc cttcaccacc agcgtgcgca 
4681 gctacctgcc caacacggtg accgacgcac tgcgggggag cggggcgtgg gggctgctgc 
4741 tgcgccgcgt gggcgacgac gtgctggttc acctgctggc acgctgcgcg ctctttgtgc 
4801 tggtggctcc cagctgcgcc taccaggtgt gcgggccgcc gctgtaccag ctcggcgctg 
4861 ccactcaggc ccggcccccg ccacacgcta gtggaccccg aaggcgtctg ggatgcgaac 
4921 gggcctggaa ccatagcgtc agggaggccg gggtccccct gggcctgcca gccccgggtg 
4981 cgaggaggcg cgggggcagt gccagccgaa gtctgccgtt gcccaagagg cccaggcgtg 
5041 gcgctgcccc tgagccggag cggacgcccg ttgggcaggg gtcctgggcc cacccgggca 
5101 ggacgcgtgg accgagtgac cgtggtttct gtgtggtgtc acctgccaga cccgccgaag 
5161 aagccacctc tttggagggt gcgctctctg gcacgcgcca ctcccaccca tccgtgggcc 
5221 gccagcacca cgcgggcccc ccatccacat cgcggccacc acgtccctgg gacacgcctt 
5281 gtcccccggt gtacgccgag accaagcact tcctctactc ctcaggcgac aaggagcagc 
5341 tgcggccctc cttcctactc agctctctga ggcccagcct gactggcgct cggaggctcg 
5401 tggagaccat ctttctgggt tccaggccct ggatgccagg gactccccgc aggttgcccc 
5461 gcctgcccca gcgctactgg caaatgcggc ccctgtttct ggagctgctt gggaaccacg 
5521 cgcagtgccc ctacggggtg ctcctcaaga cgcactgccc gctgcgagct gcggtcaccc 

Fig. 



5581 cagcagccgg tgtctgtgcc cgggagaagc 
5641 aggacacaga cccccgtcgc ctggtgcagc 
5701 tgtacggctt cgtgcgggcc tgcctgcgcc 
5761 ggcacaacga acgccgcttc ctcaggaaca 
5821 ccaagctctc gctgcaggag ctgacgtgga 
5881 gcaggagccc aggggttggc tgtgttccgg 
5941 tggccaagtt cctgcactgg ctgatgagtg 
6001 tttatgtcac ggagaccacg tttcaaaaga 
6061 ggagcaagtt gcaaagcatt ggaatcagac 
6121 tgtcggaagc agaggtcagg cagcatcggg 
6181 tccgcttcat ccccaagcct gacgggctgc 
6241 gagccagaac gttccgcaga gaaaagaggg 
6301 tgttcagcgt gctcaactac gagcgggcgc 
6361 tgggcctgga cgatatccac agggcctggc 
6421 acccgccgcc tgagctgtac tttgtcaagg 
6481 cccaggacag gctcacggag gtcatcgcca 
6541 tgcgtcggta tgccgtggtc cagaaggccg 
6601 gccacgtcct acgtccagtg ccaggggatc 
6661 tgcagcctgt gctacggcga catggagaac 
6721 ctgctcctgc gtttggtgga tgatttcttg 
6781 acttcctcag gacctggtcc gaagtgtcct 
6841 acagtggtga acttccctgt agaagacgaa 
6901 ccggcccacg gcctattccc ctggtgcggc 
6961 cagagcgact actccagcta tgcccggacc 
7021 ggcttcaagg ctgggaggaa catgcgtcgc 
7081 cacagcctgt ttctggattt gcaggtgaac 
7141 aagatcctcc tgctgcaggc gtacaggttt 
7201 cagcaagttt ggaagaaccc cacatttttc 
7261 tgctactcca tcctgaaagc caagaacgca 
7321 gcggtgcttg ggtcgggaca gccagagatg 
7381 agctttccgg tgtctcctgg gaggggagtt 
7441 tttcccccag ggatgtcgct gggggccaag 
7501 gtgcagtggc tgtgccacca agcattcctg 
7561 gtgccactcc tggggtcact caggacaggc 



cccagggctc tgtggcggcc cccgaggagg 
tgctccgcca gcacagcagc ccctggcagg 
ggctggtgcc cccaggcctc tggggctcca 
ccaagaagtt catctccctg gggaagcatg 
agatgagcgt gcgggactgc gcttggctgc 
ccgcagagca ccgtctgcgt gaggagatcc 
tgtacgtcgt cgagctgctc aggtctttct 
acaggctctt tttctaccgg aagagtgtct 
agcacttgaa gagggtgcag ctgcgggagc 
aagccaggcc cgccctgctg acgtccagac 
ggccgattgt gaacatggac tacgtcgtgg 
ccgagcgtct cacctcgagg gtgaaggcac 
ggcgccccgg cctcctgggc gcctctgtgc 
gcaccttcgt gctgcgtgtg cgggcccagg 
tggatgtgac gggcgcgtac gacaccatcc 
gcatcatcaa accccagaac acgtactgcg 
cccatgggca cgtccgcaag gccttcaaga 
ccgcagggct ccatcctctc cacgctgctc 
aagctgtttg cggggattcg gcgggacggg 
ttggtgacac ctcacctcac ccacgcgaaa 
gagtatggct gcgtggtgaa cttgcggaag 
gccctgggtg gcacggcttt tgttcagatg 
ctgctgctgg atacccggac cctggaggtg 
tccatcagag ccagtctcac cttcaaccgc 
aaactctttg gggtcttgcg gctgaagtgt 
agcctccaga cggtgtgcac caacatctac 
cacgcatgtg tgctgcagct cccatttcat 
ctgcgcgtca tctctgacac ggcctccctc 
gccgaagaaa acatttctgt cgtgactcct 
gagccacccc gcagaccgtc gggtgtgggc 
gggctgggcc tgtgactcct cagcctctgt 
ggcgccgccg gccctctgcc ctccgaggcc 
ctcaagctga ctcgacaccg tgtcacctac 
aagtgtgggt ggaggccagt gcggg 
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LOCUS pAKI28.7 7797 bp dsDNA Circular 

DEFINITION Human telomerase clone with alternative C-terminus 



1 tcgacctgca ggcatgcaag cttggcactg gccgtcgttt tacaacgtcg tgactgggaa 
61 aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 
121 aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 
181 tggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcataaat 
241 tccctgtttt ggcggatgag agaagatttt cagcctgata cagattaaat cagaacgcag 
301 aagcggtctg ataaaacaga atttgcctgg cggcagtagc gcggtggtcc cacctgaccc 
361 catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc 
421 gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct 
481 ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat ccgccgggag 
541 cggatttgaa cgttgcgaag caacggcccg gagggtggcg ggcaggacgc ccgccataaa 
601 ctgccaggca tcaaattaag cagaaggcca tcctgacgga tggccttttt gcgtttctac 
661 aaactcttcc tgtcgtcata tctacaagcc atccccccac agatacggta aactagcctc 
721 gtttttgcat caggaaagca gggaatttat ggtgcactct cagtacaatc tgctctgatg 
781 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 
841 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 
901 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 
961 ttttataggt taatgtcatg ataataatgg tttcttagac gtgaggttct gtacccgaca 
1021 ccatcgaatg gtgcaaaacc tttcgcggta tggcatgata gcgcccggaa gagagtcaat 
1081 tcagggtggt gaatgtgaaa ccagtaacgt tatacgatgt cgcagagtat gccggtgtct 
1141 cttatcagac cgtttcccgc gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg 
1201 aaaaagtgga agcggcgatg gcggagctga attacattcc caaccgcgtg gcacaacaac 
1261 tggcgggcaa acagtcgttg ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc 
1321 cgtcgcaaat tgtcgcggcg attaaatctc gcgccgatca actgggtgcc agcgtggtgg 
1381 tgtcgatggt agaacgaagc ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg 
1441 cgcaacgcgt cagtgggctg atcattaact atccgctgga tgaccaggat gccattgctg 
1501 tggaagctgc ctgcactaat gttccggcgt tatttcttga tgtctctgac cagacaccca 
1561 tcaacagtat tattttctcc catgaagacg gtacgcgact gggcgtggag catctggtcg 
1621 cattgggtca ccagcaaatc gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc 
1681 tgcgtctggc tggctggcat aaatatctca ctcgcaatca aattcagccg atagcggaac 
1741 gggaaggcga ctggagtgcc atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg 
1801 gcatcgttcc cactgcgatg ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg 
1861 ccattaccga gtccgggctg cgcgttggtg cggatatctc ggtagtggga tacgacgata 
1921 ccgaagacag ctcatgttat atcccgccgt taaccaccat caaacaggat tttcgcctgc 
1981 tggggcaaac cagcgtggac cgcttgctgc aactctctca gggccaggcg gtgaagggca 
2041 atcagctgtt gcccgtctca ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa 
2101 ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac 
2161 tggaaagcgg gcagtgagcg caacgcaatt aatgtaagtt agctcactca ttaggcaccc 
2221 caggctttac actttatgct tccgacctgc aagaacctca cgtcaggtgg cacttttcgg 
2281 ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg 
234.1 ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt 
2401 attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt 
2461 gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg 
2521 ggttacatcg agaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 
2581 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 
2641 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 
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2701 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 
2761 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 
2821 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 
2881 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 
2941 tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 
3001 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 
3061 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 
3121 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 
3181 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 
3241 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 
3301 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 
3361 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 
3421 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 
3481 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 
3541 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 
3601 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 
3661 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 
3721 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 
3781 gaacgaccta caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc 
3841 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 
3901 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 
3961 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 
4021 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct 
4081 ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata 
4141 ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc 
4201 gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agaattaatt 
4261 ctcatgtttg acagcttatc atcgactgca cggtgcacca atgcttctgg cgtcaggcag 
4321 ccatcggaag ctgtggtatg gctgtgcagg tcgtaaatca ctgcataatt cgtgtcgctc 
4381 aaggcgcact cccgttctgg ataatgtttt ttgcgccgac atcataacgg ttctggcaaa 
4441 tattctgaaa tgagctgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag 
4501 cggataacaa tttcacacag gaaacagcga tgaattcaga tctcaccatg aaggagctgg 
4561 tggcccgagt gctgcagagg ctgtgcgagc gcggcgcgaa gaacgtgctg gccttcggct 
4621 tcgcgctgct ggacggggcc cgcgggggcc cccccgaggc cttcaccacc agcgtgcgca 
4681 gctacctgcc caacacggtg accgacgcac tgcgggggag cggggcgtgg gggctgctgc 
4741 tgcgccgcgt gggcgacgac gtgctggttc acctgctggc acgctgcgcg ctctttgtgc 
4801 tggtggctcc cagctgcgcc taccaggtgt gcgggccgcc gctgtaccag ctcggcgctg 
4861 ccactcaggc ccggcccccg ccacacgcta gtggaccccg aaggcgtctg ggatgcgaac 
4921 gggcctggaa ccatagcgtc agggaggccg gggtccccct gggcctgcca gccccgggtg 
4981 cgaggaggcg cgggggcagt gccagccgaa gtctgccgtt gcccaagagg cccaggcgtg 
5041 gcgctgcccc tgagccggag cggacgcccg ttgggcaggg gtcctgggcc cacccgggca 
5101 ggacgcgtgg accgagtgac cgtggtttct gtgtggtgtc acctgccaga cccgccgaag 
5161 aagccacctc tttggagggt gcgctctctg gcacgcgcca ctcccaccca tccgtgggcc 
5221 gccagcacca cgcgggcccc ccatccacat cgcggccacc acgtccctgg gacacgcctt 
5281 gtcccccggt gtacgccgag accaagcact tcctctactc ctcaggcgac aaggagcagc 
5341 tgcggccctc cttcctactc agctctctga ggcccagcct gactggcgct cggaggctcg 
5401 tggagaccat ctttctgggt tccaggccct ggatgccagg gactccccgc aggttgcccc 
5461 gcctgcccca gcgctactgg caaatgcggc ccctgtttct ggagctgctt gggaaccacg 
5521 cgcagtgccc ctacggggtg ctcctcaaga cgcactgccc gctgcgagct gcggtcaccc 
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5581 cagcagccgg tgtctgtgcc cgggagaagc cccagggctc tgtggcggcc cccgaggagg 
5641 aggacacaga cccccgtcgc ctggtgcagc tgctccgcca gcacagcagc ccctggcagg 
5701 tgtacggctt cgtgcgggcc tgcctgcgcc ggctggtgcc cccaggcctc tggggctcca 
5761 ggcacaacga acgccgcttc ctcaggaaca ccaagaagtt catctccctg gggaagcatg 
5821 ccaagctctc gctgcaggag ctgacgtgga agatgagcgt gcgggactgc gcttggctgc 
5881 gcaggagccc aggggttggc tgtgttccgg ccgcagagca ccgtctgcgt gaggagatcc 
5941 tggccaagtt cctgcactgg ctgatgagtg tgtacgtcgt cgagctgctc aggtctttct 
6001 tttatgtcac ggagaccacg tttcaaaaga acaggctctt tttctaccgg aagagtgtct 
6061 ggagcaagtt gcaaagcatt ggaatcagac agcacttgaa gagggtgcag ctgcgggagc 
6121 tgtcggaagc agaggtcagg cagcatcggg aagccaggcc cgccctgctg acgtccagac 
6181 tccgcttcat ccccaagcct gacgggctgc ggccgattgt gaacatggac tacgtcgtgg 
6241 gagccagaac gttccgcaga gaaaagaggg ccgagcgtct cacctcgagg gtgaaggcac 
6301 tgttcagcgt gctcaactac gagcgggcgc ggcgccccgg cctcctgggc gcctctgtgc 
6361 tgggcctgga cgatatccac agggcctggc gcaccttcgt gctgcgtgtg cgggcccagg 
6421 acccgccgcc tgagctgtac tttgtcaagg tggatgtgac gggcgcgtac gacaccatcc 
6481 cccaggacag gctcacggag gtcatcgcca gcatcatcaa accccagaac acgtactgcg 
6541 tgcgtcggta tgccgtggtc cagaaggccg cccatgggca cgtccgcaag gccttcaaga 
6601 gccacgtctc taccttgaca gacctccagc cgtacatgcg acagttcgtg gctcacctgc 
6661 aggagaccag cccgctgagg gatgccgtcg tcatcgagca gagctcctcc ctgaatgagg 
6721 ccagcagtgg cctcttcgac gtcttcctac gcttcatgtg ccaccacgcc gtgcgcatca 
6781 ggggcaagtc ctacgtccag tgccagggga tcccgcaggg ctccatcctc tccacgctgc 
6841 tctgcagcct gtgctacggc gacatggaga acaagctgtt tgcggggatt cggcgggacg 
6901 ggctgctcct gcgtttggtg gatgatttct tgttggtgac acctcacctc acccacgcga 
6961 aaacttcctc aggacctggt ccgaagtgtc ctgagtatgg ctgcgtggtg aacttgcgga 
7021 agacagtggt gaacttccct gtagaagacg aagccctggg tggcacggct tttgttcaga 
7081 tgccggccca cggcctattc ccctggtgcg gcctgctgct ggatacccgg accctggagg 
7141 tgcagagcga ctactccagc tatgcccgga cctccatcag agccagtctc accttcaacc 
7201 gcggcttcaa ggctgggagg aacatgcgtc gcaaactctt tggggtcttg cggctgaagt 
7261 gtcacagcct gtttctggat ttgcaggtga acagcctcca gacggtgtgc accaacatct 
7321 acaagatcct cctgctgcag gcgtacaggt ttcacgcatg tgtgctgcag ctcccatttc 
7381 atcagcaagt ttggaagaac cccacatttt tcctgcgcgt catctctgac acggcctccc 
7441 tctgctactc catcctgaaa gccaagaacg cagccgaaga aaacatttct gtcgtgactc 
7501 ctgcggtgct tgggtcggga cagccagaga tggagccacc ccgcagaccg tcgggtgtgg 
7561 gcagctttcc ggtgtctcct gggaggggag ttgggctggg cctgtgactc ctcagcctct 
7621 gttttccccc agggatgtcg ctgggggcca agggcgccgc cggccctctg ccctccgagg 
7681 ccgtgcagtg gctgtgccac caagcattcc tgctcaagct gactcgacac cgtgtcacct 
7741 acgtgccact cctggggtca ctcaggacag gcaagtgtgg gtggaggcca gtgcggg 
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LOCUS pAKI28.14 7688 bp dsDNA Circular 

DEFINITION Human telomerase clone with exon alpha spliced out 

1 tcgacctgca ggcatgcaag cttggcactg gccgtcgttt tacaacgtcg tgactgggaa 
61 aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt 
121 aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa 
181 tggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcataaat 
241 tccctgtttt ggcggatgag agaagatttt cagcctgata cagattaaat cagaacgcag 
301 aagcggtctg ataaaacaga atttgcctgg cggcagtagc gcggtggtcc cacctgaccc 
361 catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtggggt ctccccatgc 
421 gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct 
481 ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat ccgccgggag 
541 cggatttgaa cgttgcgaag caacggcccg gagggtggcg ggcaggacgc ccgccataaa 
601 ctgccaggca tcaaattaag cagaaggcca tcctgacgga tggccttttt gcgtttctac 
661 aaactcttcc tgtcgtcata tctacaagcc atccccccac agatacggta aactagcctc 
721 gtttttgcat caggaaagca gggaatttat ggtgcactct cagtacaatc tgctctgatg 
781 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 
841 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 
901 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 
961 ttttataggt taatgtcatg ataataatgg tttcttagac gtgaggttct gtacccgaca 
1021 ccatcgaatg gtgcaaaacc tttcgcggta tggcatgata gcgcccggaa gagagtcaat 
1081 tcagggtggt gaatgtgaaa ccagtaacgt tatacgatgt cgcagagtat gccggtgtct 
1141 cttatcagac cgtttcccgc gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg 
1201 aaaaagtgga agcggcgatg gcggagctga attacattcc caaccgcgtg gcacaacaac 
1261 tggcgggcaa acagtcgttg ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc 
1321 cgtcgcaaat tgtcgcggcg attaaatctc gcgccgatca actgggtgcc agcgtggtgg 
1381 tgtcgatggt agaacgaagc ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg 
1441 cgcaacgcgt cagtgggctg atcattaact atccgctgga tgaccaggat gccattgctg 
1501 tggaagctgc ctgcactaat gttccggcgt tatttcttga tgtctctgac cagacaccca 
1561 tcaacagtat tattttctcc catgaagacg gtacgcgact gggcgtggag catctggtcg 
1621 cattgggtca ccagcaaatc gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc 
1681 tgcgtctggc tggctggcat aaatatctca ctcgcaatca aattcagccg atagcggaac 
1741 gggaaggcga ctggagtgcc atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg 
1801 gcatcgttcc cactgcgatg ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg 
1861 ccattaccga gtccgggctg cgcgttggtg cggatatctc ggtagtggga tacgacgata 
1921 ccgaagacag ctcatgttat atcccgccgt taaccaccat caaacaggat tttcgcctgc 
1981 tggggcaaac cagcgtggac cgcttgctgc aactctctca gggccaggcg gtgaagggca 
2041 atcagctgtt gcccgtctca ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa 
2101 ccgcctctcc ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac 
2161 tggaaagcgg gcagtgagcg caacgcaatt aatgtaagtt agctcactca ttaggcaccc 
2221 caggctttac actttatgct tccgacctgc aagaacctca cgtcaggtgg cacttttcgg 
2281 ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa tatgtatccg 
2341 ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa gagtatgagt 
2401 attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct tcctgttttt 
2461 gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg 
2521 ggttacatcg agaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 
2581 aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 
2641 ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 
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2701 agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 
2761 gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 
2821 gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 
2881 gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg 
2941 tagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc 
3001 ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg 
3061 cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg 
3121 gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga 
3181 cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac 
3241 tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa 
3301 aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca 
3361 aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag 
3421 gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac 
3481 cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa 
3541 ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc 
3601 accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag 
3661 tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac 
3721 cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc 
3781 gaacgaccta caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc 
3841 ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca 
3901 cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc 
3961 tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg 
4021 ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct 
4081 ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata 
4141 ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc 
4201 gcccaatacg caaaccgcct ctccccgcgc gttggccgat tcattaatgc agaattaatt 
4261 ctcatgtttg acagcttatc atcgactgca cggtgcacca atgcttctgg cgtcaggcag 
4321 ccatcggaag ctgtggtatg gctgtgcagg tcgtaaatca ctgcataatt cgtgtcgctc 
4381 aaggcgcact cccgttctgg ataatgtttt ttgcgccgac atcataacgg ttctggcaaa 
4441 tattctgaaa tgagctgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag 
4501 cggataacaa tttcacacag gaaacagcga tgaattcaga tctcaccatg aaggagctgg 
4561 tggcccgagt gctgcagagg ctgtgcgagc gcggcgcgaa gaacgtgctg gccttcggct 
4621 tcgcgctgct ggacggggcc cgcgggggcc cccccgaggc cttcaccacc agcgtgcgca 
4681 gctacctgcc caacacggtg accgacgcac tgcgggggag cggggcgtgg gggctgctgc 
4741 tgcgccgcgt gggcgacgac gtgctggttc acctgctggc acgctgcgcg ctctttgtgc 
4801 tggtggctcc cagctgcgcc taccaggtgt gcgggccgcc gctgtaccag ctcggcgctg 
4861 ccactcaggc ccggcccccg ccacacgcta gtggaccccg aaggcgtctg ggatgcgaac 
4921 gggcctggaa ccatagcgtc agggaggccg gggtccccct gggcctgcca gccccgggtg 
4981 cgaggaggcg cgggggcagt gccagccgaa gtctgccgtt gcccaagagg cccaggcgtg 
5041 gcgctgcccc tgagccggag cggacgcccg ttgggcaggg gtcctgggcc cacccgggca 
5101 ggacgcgtgg accgagtgac cgtggtttct gtgtggtgtc acctgccaga cccgccgaag 
5161 aagccacctc tttggagggt gcgctctctg gcacgcgcca ctcccaccca tccgtgggcc 
5221 gccagcacca cgcgggcccc ccatccacat cgcggccacc acgtccctgg gacacgcctt 
5281 gtcccccggt gtacgccgag accaagcact tcctctactc ctcaggcgac aaggagcagc 
5341 tgcggccctc cttcctactc agctctctga ggcccagcct gactggcgct cggaggctcg 
5401 tggagaccat ctttctgggt tccaggccct ggatgccagg gactccccgc aggttgcccc 
5461 gcctgcccca gcgctactgg caaatgcggc ccctgtttct ggagctgctt gggaaccacg 
5521 cgcagtgccc ctacggggtg ctcctcaaga cgcactgccc gctgcgagct gcggtcaccc 
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5581 cagcagccgg tgtctgtgcc cgggagaagc 
5641 aggacacaga cccccgtcgc ctggtgcagc 
5701 tgtacggctt cgtgcgggcc tgcctgcgcc 
5761 ggcacaacga acgccgcttc ctcaggaaca 
5821 ccaagctctc gctgcaggag ctgacgtgga 
5881 gcaggagccc aggggttggc tgtgttccgg 
5941 tggccaagtt cctgcactgg ctgatgagtg 
6001 tttatgtcac ggagaccacg tttcaaaaga 
6061 ggagcaagtt gcaaagcatt ggaatcagac 
6121 tgtcggaagc agaggtcagg cagcatcggg 
6181 tccgcttcat ccccaagcct gacgggctgc 
6241 gagccagaac gttccgcaga gaaaagaggg 
6301 tgttcagcgt gctcaactac gagcgggcgc 
6361 tgggcctgga cgatatccac agggcctggc 
6421 acccgccgcc tgagctgtac tttgtcaagg 
6481 tcaaacccag aacacgtact gcgtgcgtcg 
6541 gcacgtccgc aaggccttca agagccacgt 
6601 gcgacagttc gtggctcacc tgcaggagac 
6661 gcagagctcc tccctgaatg aggccagcag 
6721 gtgccaccac gccgtgcgca tcaggggcaa 
6781 gggctccatc ctctccacgc tgctctgcag 
6841 gtttgcgggg attcggcggg acgggctgct 
6901 gacacctcac ctcacccacg cgaaaacctt 
6961 gtatggctgc gtggtgaact tgcggaagac 
7021 cctgggtggc acggcttttg ttcagatgcc 
7081 gctgctggat acccggaccc tggaggtgca 
7141 catcagagcc agtctcacct tcaaccgcgg 
7201 actctttggg gtcttgcggc tgaagtgtca 
7261 cctccagacg gtgtgcacca acatctacaa 
7321 cgcatgtgtg ctgcagctcc catttcatca 
7381 gcgcgtcatc tctgacacgg cctccctctg 
7441 gatgtcgctg ggggccaagg gcgccgccgg 
7501 gtgccaccaa gcattcctgc tcaagctgac 
7561 ggggtcactc aggacagccc agacgcagct 
7621 tgccctggag gccgcagcca acccggcact 
7681 atctagag 



cccagggctc tgtggcggcc cccgaggagg 
tgctccgcca gcacagcagc ccctggcagg 
ggctggtgcc cccaggcctc tggggctcca 
ccaagaagtt catctccctg gggaagcatg 
agatgagcgt gcgggactgc gcttggctgc 
ccgcagagca ccgtctgcgt gaggagatcc 
tgtacgtcgt cgagctgctc aggtctttct 
acaggctctt tttctaccgg aagagtgtct 
agcacttgaa gagggtgcag ctgcgggagc 
aagccaggcc cgccctgctg acgtccagac 
ggccgattgt gaacatggac tacgtcgtgg 
ccgagcgtct cacctcgagg gtgaaggcac 
ggcgccccgg cctcctgggc gcctctgtgc 
gcaccttcgt gctgcgtgtg cgggcccagg 
acaggctcac ggaggtcatc gccagcatca 
gtatgccgtg gtccagaagg ccgcccatgg 
ctctaccttg acagacctcc agccgtacat 
cagcccgctg agggatgccg tcgtcatcga 
tggcctcttc gacgtcttcc tacgcttcat 
gtcctacgtc cagtgccagg ggatcccgca 
cctgtgctac ggcgacatgg agaacaagct 
cctgcgtttg gtggatgatt tcttgttggt 
cctcaggacc ctggtccgag gtgtccctga 
agtggtgaac ttccctgtag aagacgaggc 
ggcccacggc ctattcccct ggtgcggcct 
gagcgactac tccagctatg cccggacctc 
cttcaaggct gggaggaaca tgcgtcgcaa 
cagcctgttt ctggatttgc aggtgaacag 
gatcctcctg ctgcaggcgt acaggtttca 
gcaagtttgg aagaacccca catttttcct 
ctactccatc ctgaaagcca agaacgcagg 
ccctctgccc tccgaggccg tgcagtggct 
tcgacaccgt gtcacctacg tgccactcct 
gagtcggaag ctcccgggga cgacgctgac 
gccctcagac ttcaagacca tcctggactg 
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